Data cleansing or data cleaning is the process of detecting and correcting (or removing) corrupt or inaccurate records from a record set, table, or database and refers to identifying incomplete, incorrect, inaccurate or irrelevant parts of the data and then replacing, modifying, or deleting the dirty or coarse data. Data cleansing may be performed interactively with data wrangling tools, or as batch processing through scripting or a data quality firewall.
References
-
What is Data Cleansing how and why businesses use Data Cleansing, and how to use Data Cleansing with AWS.🔗Amazon Web Services, Inc.
-
-
Data cleansing, also known as data cleaning or scrubbing, identifies and fixes errors, duplicates, and irrelevant data from a raw dataset.🔗Alteryx
-
Data cleansing or scrubbing is the process of fixing errors and other issues in data sets. Learn about the data cleansing process and its business benefits.🔗Data Management