From the course: Data Analysis with Python and Pandas

Unlock this course with a free trial

Join today to access over 25,500 courses taught by industry experts.

Mid-course project intro

Mid-course project intro

- [Instructor] Okay, everybody. We're now able to perform a lot of different types of aggregations and data manipulation on our Pandas data structures, like Series and DataFrames. This is a good opportunity to take a step back from learning new material and test our understanding of the material covered with a midcourse project. We'll start by looking at our Transactions dataset. No, this is not the same Transactions dataset we've been working with in our assignments. We have a dataset of transactions made by various households for the retailer we want to acquire. We have household_key, which is an ID variable representing the household that made purchases. We have BASKET_ID, which represents a given transaction. For example, if you went to the store and bought five items, these five rows would represent each item you bought in your trip to the store. We then have the DAY that the purchase was made. We have the PRODUCT_ID of the product for each line item. We have the QUANTITY for…

Contents