Issue Identification

5.2. Issue Identification#

Once we have previewed our data, we can start identifying various issues in it. These issues can range from missing data and inconsistent categories to non-standard values and duplicate records.

We can only clean or fix our datasets after we become aware of these problems. This step acts as a precursor to the next chapter on Data Wrangling, where you will learn how to address issues such as incorrect data types, inconsistent formats, and other data quality challenges.

We will go over some of the commonly occuring issues and how we can identify them.