Data mining basics and kdd process

first why we need data mining?

              Now a days lot of data are created and collected regularly. Analysing such data is very important. So that we get valuable information. Thus the purpose of data mining is to turn the data into knowledge.

what is data mining?

               Data mining is the process of discovering interesting patterns and knowledge from large amount of data.databases,data warehouses,web,internet,social medias,system generated data such as log file are the sources of data.
               Knowledge mining from data,knowledge extraction,data or pattern analysis,data archaeology,data dredging are the similar words to the data mining,knowledge discovery from data. The process consists of following steps
             Data cleaning
             Data integration
             Data selection
             Data transformation
             Data mining
             Pattern evaluation
             Knowledge presentation

Data cleaning

            System generated data or data stored in the database has some unnecessary data and errors.in the starting phase we have to remove those noise. So that the results are accurate

Data integration

            Data mining has to be done in various type of resources. So all the data sources are converted to the suitable format and combined.

Data selection

             Data suitable for the analysis are retrieved from the databases.

Data transformation

              Data are transformed into suitable forms by summary or aggregation operation.

Data mining

               Patterns are extracted by applying methods.

Pattern evaluation

                Identifying the interesting patterns according to the  interestingness measures.

Knowledge presentation

                 Visualization and knowledge representation are used for effective understanding of users.
   
               


              
x

Comments