👉 Tip #41: Profile your pandas DataFrame in one line of code

Let’s say that you’ve got a new dataset you want to quickly explore without too much work. Here’s what to do:

Step 1: Install ydata-profiling

Step 2: Import ydata_profiling in Jupyter

Step 3: Run the ProfileReport() function and pass it any DataFrame

Voilà! It returns an interactive report in HTML format.

The report’s first section is an overview of the dataset and a list of possible issues with the data:

The second section gives a summary of each column:

The third section lets you explore feature interactions:

The fourth section visualizes missing values:

And the final section shows a sample of the dataset:

