EDUCATION & TRAINING
Building Modern EDA Pipelines with Pingouin
KDnuggets
About This Tutorial
Learn how to build a holistic pipeline for rigorous, statistical EDA, validating several important data properties. Anyone who has spent a fair amount of time doing data science may sooner or later learn something: the golden rule of downstream machine learning modeling, known as garbage in, garbage out (GIGO). As an example open dataset, we will use one containing samples of wine properties and their quality.