The world’s leading publication for data science, AI, and ML professionals.

Weekly Selection – Dec 18, 2020

Increase Productivity: Data Cleaning using Python and Pandas

Photo by Galina N on Unsplash
Photo by Galina N on Unsplash

Data cleaning can be time-consuming, but understanding the different types of missing values, and how to deal with them, will significantly increase your productivity.

By Renato Boemer – 6 min read


Causal Inference in Data Science: G-Estimation of Structural Nested Models

Mathematical Derivation and Computational Simulation of the semi-parametric class of G-methods

By Andrew Rothman – 14 min read


Classification, regression, and prediction – what’s the difference?

Know your species of Machine Learning task

By Cassie Kozyrkov – 5 min read


Forgetting in Deep Learning

A study of techniques that are related to catastrophic forgetting in deep neural networks.

By Mingyue Wei, Qiang Fei, Yingsi Jian, Shuyuan Xiao – 11 min read


Hidden Markov Model for Biological Sequence

This article presents a short introduction on Markov Chain and Hidden Markov Models with an emphasis on their application on bio-sequences.

By Mattia Cinelli – 9 min read


A Closer Look at the Biomass Burned in California’s Wildfires

We compared tree species and biomass data from UC Santa Cruz’s ForestGEO site to wildfires like the one just northwest of Santa Cruz proper and here’s what we found.

By Lauren Low – 7 min read


The Travelling Apothecary

A discrete optimization approach to Skyrim’s alchemy system.

By Nathan Pratt – 12 min read


Unwrapping the Swiss Roll with Diffusion Maps

The amazing dimensionality reduction algorithm you may have never heard of

By Sebastian Dick – 11 min read


Related Articles