Data cleaning can be time-consuming, but understanding the different types of missing values, and how to deal with them, will significantly increase your productivity.
By Renato Boemer – 6 min read
Causal Inference in Data Science: G-Estimation of Structural Nested Models
Mathematical Derivation and Computational Simulation of the semi-parametric class of G-methods
By Andrew Rothman – 14 min read
Classification, regression, and prediction – what’s the difference?
Know your species of Machine Learning task
By Cassie Kozyrkov – 5 min read
Forgetting in Deep Learning
A study of techniques that are related to catastrophic forgetting in deep neural networks.
By Mingyue Wei, Qiang Fei, Yingsi Jian, Shuyuan Xiao – 11 min read
Hidden Markov Model for Biological Sequence
This article presents a short introduction on Markov Chain and Hidden Markov Models with an emphasis on their application on bio-sequences.
By Mattia Cinelli – 9 min read
A Closer Look at the Biomass Burned in California’s Wildfires
We compared tree species and biomass data from UC Santa Cruz’s ForestGEO site to wildfires like the one just northwest of Santa Cruz proper and here’s what we found.
By Lauren Low – 7 min read
The Travelling Apothecary
A discrete optimization approach to Skyrim’s alchemy system.
By Nathan Pratt – 12 min read
Unwrapping the Swiss Roll with Diffusion Maps
The amazing dimensionality reduction algorithm you may have never heard of
By Sebastian Dick – 11 min read