Pandas Profiling: Quick way to analyze the data.
Pandas Profiling is a python library that can help in a quicker understanding of the data. Using this library we will create an HTML page that will give a summary of the data frame, the relation among the data available, and much more.
Let's try to bring data to a data frame and after that, we will apply pandas profiling to it.
Once we have the data frame with us, let's install Pandas profiling.
I have specified the version because without the version it was not working for me. Once we have the library installed, we can call it and ask the library to create an HTML file.
After the execution, we have the HTML file with us which contains all the information that we can need to understand the data.
Above is the first page of the html file, which gives very brief info about the fields of the data frame.
The first tab gives an overview of the total data. It shows how many fields we have and if there are any missing values. If there are correlations between any two fields.
This can help to understand if any field has repeating values or missing values.
We can change the fields in the interaction and tab and check the relation among the fields.
Using this library, we can save time used for analyzing the data. This will give all the data in a graph and a proper description.