Transcription of Data Cleaning: Problems and Current Approaches
{{id}} {{{paragraph}}}
Data Cleaning: Problems and Current ApproachesErhard Rahm Hong Hai DoUniversity of Leipzig, classify data quality Problems that are addressed by data cleaning and provide an overview of the mainsolution Approaches . Data cleaning is especially required when integrating heterogeneous data sources andshould be addressed together with schema-related data transformations. In data warehouses, data cleaning isa major part of the so-called ETL process. We also discuss Current tool support for data IntroductionData cleaning, also called data cleansing or scrubbing, deals with detecting and removing errors andinconsistencies from data in order to improve the quality of data.
2 data inconsistencies and the sheer data volume, data cleaning is considered to be one of the biggest problems in data warehousing. During the so-called ETL process (extraction, transformation, loading), illustrated in
Domain:
Source:
Link to this page:
Please notify us if you found a problem with this document:
{{id}} {{{paragraph}}}