The world’s leading publication for data science, AI, and ML professionals.

Weekly Selection – Nov 9, 2018

Why you shouldn't be a data science generalist

By Jeremie Harris – 6 min read

I work at a data science mentorship startup, and I’ve found there’s a single piece of advice that I catch myself giving over and over again to aspiring mentees. And it’s really not what I would have expected it to be.


My secret sauce to be in top 2% of a kaggle competition

By Abhay Pawar – 7 min read

Competing in kaggle competitions is fun and addictive! And over the last couple of years, I developed some standard ways to explore features and build better machine learning models.


5 Bite-Sized Data Science Summaries

By Cassie Kozyrkov – 7 min read

In the spirit of teamwork, the Next Rewind video seriesasked a bunch of people to pick up to five favorite talks from Google Cloud Next SF 2018 and discuss them on camera in no more than five minutes.


Prediction Engineering: How to Set Up Your Machine Learning Problem

By William Koehrsen – 9 min read

These articles will cover the concepts and a full implementation as applied to predicting customer churn. The project Jupyter Notebooks are all available on GitHub.


Scheduling with ease: Cost optimization tutorial for Python

By Eric Stoltz – 6 min read

Congratulations! You’re the proud new owner of the coolest store in town. To keep the operation running, you need to ensure that you have the correct number of workers scheduled for each shift. In this tutorial, we’ll design the lowest cost schedule for the upcoming week.


I was looking for a house, so I built a web scraper in Python!

By Fábio Neves – 8 min read

In a few months I’ll have to leave my rented apartment and look for a new one. As painful as this experience can be, especially as a real estate bubble looms in the horizon, I decided to use it as yet another incentive to improve my Python skills! In the end I want to be able to do two things


Using 3D visualizations to tune hyperparameters in ML models

By Xoel López Barata – 6 min read

Imagine that you’re trying to develop a solution to Kaggle’s Rossmann Store Sales competition. You’ve done a lot of feature engineering and created a ton of new variables that may help you predict future sales better.


Time Series Forecasting with RNNs

By Marek Galovič – 6 min read

In this article I want to give you an overview of a RNN model I built to forecast time series data. Main objectives of this work were to design a model that can not only predict the very next time step but rather generate a sequence of predictions and utilize multiple driving time series together with a set of static (scalar) features as its inputs.


Related Articles