EDUCATION & TRAINING

Building Modern EDA Pipelines with Pingouin

KDnuggets

About This Tutorial

Learn how to build a holistic pipeline for rigorous, statistical EDA, validating several important data properties. Anyone who has spent a fair amount of time doing data science may sooner or later learn something: the golden rule of downstream machine learning modeling, known as garbage in, garbage out (GIGO). As an example open dataset, we will use one containing samples of wine properties and their quality.