Exploratory Data Analysis (EDA)

perform_eda(data, provider='pandas-profiling', sample_size=10000, data_randomizer=2)[source]

Performs Exploratory Data Analysis (EDA)

data: pandas dataframe

Dataframe for exploratory data analysis

provider{‘pandas-profiling’, ‘sweetviz’, ‘dtale’}, default=’pandas-profiling’

Library provider for exploratory data analysis

sample_size: str, default=10000

Number of rows to return from dataframe. None to perform eda on the complete dataset which can be slower if dataset has large number of rows and columns

data_randomizer: int, default=None

Controls the data split. Provide a value to reproduce the same split.

Examples

EDA using Pandas Profiling

eda_pandas-profiling

EDA using SweetVIZ

eda_sweetviz

EDA using D-TALE

eda_dtale