Imbalanced Classes: Predicting Hotel Cancellations with Support Vector Machines
When attempting to build a classification algorithm, one must often contend with the issue of an unbalanced dataset.
Published in
6 min readFeb 12, 2020
Note: The original article is available here, with a link to the relevant GitHub repository and code for this example.
An unbalanced dataset is one where there is an unequal sample size between classes, which induces significant bias into the predictions of the classifier in…