Photo by Markus Spiske on Unsplash

Getting Started

Improve the train-test split with the hashing function

The best way to make sure the training and test sets are never mixed while updating the data set


Recently, I was reading Aurélien Géron’s Hands-On Machine Learning with Scikit-Learn, Keras and TensorFlow (2nd edition) and it made me realize that there might…



Data Scientist, quantitative finance, gamer. My latest book - Python for Finance Cookbook 2nd ed: