Data mining basics and kdd process
first why we need data mining?
Now a days lot of data are created and collected regularly. Analysing such data is very important. So that we get valuable information. Thus the purpose of data mining is to turn the data into knowledge.what is data mining?
Data mining is the process of discovering interesting patterns and knowledge from large amount of data.databases,data warehouses,web,internet,social medias,system generated data such as log file are the sources of data.Knowledge mining from data,knowledge extraction,data or pattern analysis,data archaeology,data dredging are the similar words to the data mining,knowledge discovery from data. The process consists of following steps
Data cleaning
Data integration
Data selection
Data transformation
Data mining
Pattern evaluation
Knowledge presentation
Data cleaning
System generated data or data stored in the database has some unnecessary data and errors.in the starting phase we have to remove those noise. So that the results are accurateData integration
Data mining has to be done in various type of resources. So all the data sources are converted to the suitable format and combined.Data selection
Data suitable for the analysis are retrieved from the databases.Data transformation
Data are transformed into suitable forms by summary or aggregation operation.Data mining
Patterns are extracted by applying methods.Pattern evaluation
Identifying the interesting patterns according to the interestingness measures.Knowledge presentation
Visualization and knowledge representation are used for effective understanding of users.x

Comments
Post a Comment