From the course: Data Analysis with Python and Pandas
Unlock this course with a free trial
Join today to access over 25,500 courses taught by industry experts.
Aggregation and resampling
From the course: Data Analysis with Python and Pandas
Aggregation and resampling
- [Instructor] All right, so let's take a look at time series aggregation. We can aggregate time series using the groupby method, but depending on our use case, we're going to show you a better method. But let's take a look at this with the groupby method first. So here we have our sales series. We have a bunch of dates and we have a bunch of sales as our values. And here we're accessing the month portion of our date to get a sum of sales by month. And note that this is going to calculate the total sales by month regardless of the year. So this is something we need to be careful about with traditional aggregation. We're not going to be grouping by calendar month. We're going to be grouping by any year that has a specific month. So we're grabbing values from January, 2022 and January, 2023 when we group like this. So in order to make sure we keep our year distinct, we need to specify grouping by both year and month. We could also group by year, which would just calculate the total…
Practice while you learn with exercise files
Download the files the instructor uses to teach the course. Follow along and learn by watching, listening and practicing.
Contents
-
-
-
-
-
-
-
-
-
(Locked)
Times in Python and pandas3m 8s
-
(Locked)
Converting to datetimes6m 16s
-
(Locked)
Formatting dates5m 20s
-
(Locked)
Date and time parts3m 4s
-
(Locked)
Challenge: pandas datetime basics1m 23s
-
(Locked)
Solution: pandas datetime basics2m 10s
-
(Locked)
Time deltas and arithmetic6m 54s
-
(Locked)
Challenge: Time deltas1m 10s
-
(Locked)
Solution: Time deltas1m 29s
-
(Locked)
Time series indices3m 58s
-
(Locked)
Missing time series data4m 45s
-
(Locked)
Challenge: Missing time series data1m 44s
-
(Locked)
Solution: Missing time series data2m 13s
-
(Locked)
Shifting time series3m 16s
-
(Locked)
Pro tip: diff()2m 54s
-
(Locked)
Challenge: shift() and diff()1m 39s
-
(Locked)
Solution: shift() and diff()2m 47s
-
(Locked)
Aggregation and resampling4m 6s
-
(Locked)
Challenge: Resampling41s
-
(Locked)
Solution: Resampling1m 53s
-
(Locked)
Rolling aggregations4m 35s
-
(Locked)
Challenge: Rolling aggregations45s
-
(Locked)
Solution: Rolling aggregations55s
-
(Locked)
Key takeaways1m 37s
-
(Locked)
-
-
-