From the course: Data Analysis with Python and Pandas
Unlock this course with a free trial
Join today to access over 25,500 courses taught by industry experts.
Solution: Missing data
From the course: Data Analysis with Python and Pandas
Solution: Missing data
- [Instructor] All right, everybody. Our solution code is up on the right. Let's go ahead and dive into the notebook. All right, so once again, we have our oil series. We've already filled in the missing values, so we just need to recreate this series. And then our first task was to count the number of missing values. So to do this, we just need to do oil_series.isna. Remember, this will return a Boolean series indicating whether or not the value is missing. So, dot sum or count or missing values, we just need to do is an a.sum that will return the number of missing values, which was two in this dataset. To fill in the missing values here, we want to use the fill in a method. And so now that we have our missing count, we just need to fill in the missing values with the median of this series. So we'll use a "fillna" method. So we use a "fillna" method, and we'll pass an oil_series.median, and this will fill in the missing values in this series with the median of the oil price series…
Practice while you learn with exercise files
Download the files the instructor uses to teach the course. Follow along and learn by watching, listening and practicing.
Contents
-
-
-
-
(Locked)
Series basics10m
-
(Locked)
pandas data types and type conversion6m 46s
-
(Locked)
Challenge: Data types and type conversion2m 23s
-
(Locked)
Solution: Data types and type conversion3m 5s
-
(Locked)
The series index and custom indices7m 6s
-
(Locked)
The .iloc accessor4m 33s
-
(Locked)
The .loc accessor7m 3s
-
(Locked)
Duplicate index values and resetting the index6m 33s
-
(Locked)
Challenge: Accessing data and resetting the index2m 1s
-
(Locked)
Solution: Accessing data and resetting the index2m 39s
-
(Locked)
Filtering series and logical tests8m 19s
-
(Locked)
Sorting series3m 45s
-
(Locked)
Challenge: Sorting and filtering series57s
-
(Locked)
Solution: Sorting and filtering series3m 24s
-
(Locked)
Numeric series operations6m 31s
-
(Locked)
Text series operations7m 4s
-
(Locked)
Challenge: Series operations1m 36s
-
(Locked)
Solution: Series operations3m 53s
-
(Locked)
Numerical series aggregation5m 43s
-
(Locked)
Categorical series aggregation3m 32s
-
(Locked)
Challenge: Series aggregation50s
-
(Locked)
Solution: Series aggregation4m 20s
-
(Locked)
Missing data representation in pandas4m 29s
-
(Locked)
Identifying missing data2m 15s
-
(Locked)
Fixing missing data9m 27s
-
(Locked)
Challenge: Missing data45s
-
(Locked)
Solution: Missing data1m 35s
-
(Locked)
Applying custom functions to series4m 6s
-
(Locked)
pandas where() vs. NumPy where()6m 3s
-
(Locked)
Challenge: apply() and where()1m 9s
-
(Locked)
Solution: apply() and where()4m 37s
-
(Locked)
Key takeaways1m 24s
-
(Locked)
-
-
-
-
-
-
-
-