Predicting Hazardous Seismic Bumps Part I : EDA, Feature Engineering & Train Test Split for Unbalanced Dataset

This article demonstrates exploratory data analysis (EDA), feature engineering, and splitting strategies for unbalanced data using the seismic bumps dataset from the UCI Data Archive.

Nabanita Roy
Towards Data Science
10 min readAug 6, 2020

--

Photo by Dominik Vanyi on Unsplash

--

--

Data Scientist @ EY (UK & Ireland) | Education Lead @ Women in AI Ireland | ❤ NLP