From the course: Data Quality: Measure, Improve, and Enforce Reliable Systems
Unlock this course with a free trial
Join today to access over 25,500 courses taught by industry experts.
What is data quality? - Python Tutorial
From the course: Data Quality: Measure, Improve, and Enforce Reliable Systems
What is data quality?
- [Instructor] Data quality issues aren't just technical, they are expensive. According to Gartner, poor data quality costs companies an average of 12.9 to 15 million US dollar per year. During these years working with data, I have always wondered what's the true cost of bad data, and how can we measure, improve, and enforce better quality in our data systems, from pipelines to AI models? That is exactly what this course will help you do. In this video, we will define what data quality really means, why it matters more than you might think, and lay the foundation for how to measure and improve it. But let's begin with understanding what data quality really is. Let's start with a simple definition. In layman's terms, data quality refers to the degree to which the data is accurate, complete, reliable, and fit for its intended purpose. Sounds straightforward, but let's dig deeper. Good data quality means that your data is correct, free from errors, typos, or corrupt values that could…