Automated Exploratory Data Analysis using Python

When working on different projects it used to become difficult for me to do Exploratory Data Analysis as it used to involve so many steps.So, I started using automatic EDAs which used to save a lot of time for me.

Please find below the EDAs that I generally use in my daily work.I hope it will help you in saving your time.

1. PANDAS PROFILING

We can install it using the command given below.

pip install pandas-profiling

We can import it using the command given below

from pandas_profiling import ProfileReport

We can execute it using the command given below

profile = ProfileReport(df)
profile

After execution , the report we look like

No alt text provided for this image

It has three main sections:

1.Overview-As the name suggest we get all the overview of the dataset in it like number of variables, variable types etc.

2.Warnings-It shows all the warning such like high cardinality ,missing, uniform and unique.

No alt text provided for this image

3.Variables:It shows us all the information related to each column in our dataset.

No alt text provided for this image

We can also get other informations like correlations.missing values count etc.

No alt text provided for this image

2. DATA PREP

We can install it using the command given below.

pip install dataprep

We can import it using the command given below

from dataprep.eda import plot

We can execute it using the command given below

plot(df)

After execution , the report we look like

No alt text provided for this image

Each graph is intractive which means we can get count, percntage of each data in the column

No alt text provided for this image

Thank you for your time and do let me know if you know any other automatic EDA libraries for python.

To view or add a comment, sign in

Others also viewed

Explore content categories