Data cleaning is the process of fixing or removing incorrect, corrupted, incorrectly formatted, duplicate, or incomplete data within a dataset. Sometimes when combining multiple data sources, there are many possibilities for data to be duplicated or mislabeled. If data is not cleaned correctly the outcome and the results will convey wrong and misleading data.