By Jeremie Harris – 6 min read
I work at a data science mentorship startup, and I’ve found there’s a single piece of advice that I catch myself giving over and over again to aspiring mentees. And it’s really not what I would have expected it to be.
My secret sauce to be in top 2% of a kaggle competition
By Abhay Pawar – 7 min read
Competing in kaggle competitions is fun and addictive! And over the last couple of years, I developed some standard ways to explore features and build better machine learning models.
5 Bite-Sized Data Science Summaries
By Cassie Kozyrkov – 7 min read
In the spirit of teamwork, the Next Rewind video seriesasked a bunch of people to pick up to five favorite talks from Google Cloud Next SF 2018 and discuss them on camera in no more than five minutes.
Prediction Engineering: How to Set Up Your Machine Learning Problem
By William Koehrsen – 9 min read
These articles will cover the concepts and a full implementation as applied to predicting customer churn. The project Jupyter Notebooks are all available on GitHub.
Scheduling with ease: Cost optimization tutorial for Python
By Eric Stoltz – 6 min read
Congratulations! You’re the proud new owner of the coolest store in town. To keep the operation running, you need to ensure that you have the correct number of workers scheduled for each shift. In this tutorial, we’ll design the lowest cost schedule for the upcoming week.
I was looking for a house, so I built a web scraper in Python!
By Fábio Neves – 8 min read
In a few months I’ll have to leave my rented apartment and look for a new one. As painful as this experience can be, especially as a real estate bubble looms in the horizon, I decided to use it as yet another incentive to improve my Python skills! In the end I want to be able to do two things
Using 3D visualizations to tune hyperparameters in ML models
By Xoel López Barata – 6 min read
Imagine that you’re trying to develop a solution to Kaggle’s Rossmann Store Sales competition. You’ve done a lot of feature engineering and created a ton of new variables that may help you predict future sales better.
Time Series Forecasting with RNNs
By Marek Galovič – 6 min read
In this article I want to give you an overview of a RNN model I built to forecast time series data. Main objectives of this work were to design a model that can not only predict the very next time step but rather generate a sequence of predictions and utilize multiple driving time series together with a set of static (scalar) features as its inputs.