From the course: Data Preparation, Feature Engineering, and Augmentation for AI Models

Unlock this course with a free trial

Join today to access over 25,500 courses taught by industry experts.

Data exploration and initial quality assessment

Data exploration and initial quality assessment

- [Instructor] Data exploration is a critical first step before any modeling or analysis should happen. And the reason we do this is because data exploration helps us reveal hidden patterns, relationships, and potentially anomalies in our datasets. Now we use data exploration to identify data quality issues that could undermine our AI model's performance. But it's also useful because data exploration can inform both feature engineering and feature selection decisions. And then the overall benefit of data exploration is that it really helps us reduce project risks. And it does this by exposing problems early in the development cycle. Now some common data quality issues to watch out for are missing data. Now we see this in the forms of incomplete records, for example, records that have empty fields or null values. We can also have issues with inconsistent formats. So for example, we might have dates in different formats. So some dates might be in year month day format while others are…

Contents