PDF4PRO ⚡AMP

Modern search engine that looking for books and documents around the web

Example: marketing

Data cleaning and Data preprocessing

Back to document page

Preprocessing1Data cleaning andData preprocessingNguyen Hung SonThis presentation was prepared on the basis of the following public Han and Micheline Kamber, data mining, concept and techniques Piatetsky-Shapiro, kdnuggest , 2Outline Introduction data cleaning data integration and transformation data reduction Discretization and concept hierarchy generation Summarypreprocessing 3Why data preprocessing ? data in the real world is dirty incomplete: lacking attribute values, lacking certain attributes of interest, or containing only aggregate data noisy: containing errors or outliers inconsistent: containing discrepancies in codes or names No quality data , no quality mining results! Quality decisions must be based on quality data data warehouse needs consistent integration of quality datapreprocessing 4Data Understanding: Relevance What data is available for the task?

Fill in missing values, smooth noisy data, identify or remove outliers, and ... Imputation: Use the attribute mean to fill in the missing value, or use the attribute mean for all samples belonging to the same class to fill in the missing value: smarter ... Clustering detect and remove ...

  Data, Value, Cleaning, Missing, Clustering, Preprocessing, Imputation, Missing values, Data cleaning and data preprocessing

Download Data cleaning and Data preprocessing


Information

Domain:

Source:

Link to this page:

Please notify us if you found a problem with this document:

Spam in document Broken preview Other abuse

Related search queries