Stratified sampling and how to perform it in R

The proper way to sample a huge dataset

Gianluca Malato
Towards Data Science
5 min readMay 7, 2019

--

Photo by Giorgio Tomassetti on Unsplash

In a previous article, I’ve written about the importance of selecting a sample from a population in a proper way. Today I’ll show you a technique called stratified sampling, which can help us create a statistically significant sample from a huge dataset.

The correct way to sample a…

--

--

Theoretical Physicists, Data Scientist and fiction author. I teach Data Science, statistics and SQL on YourDataTeacher.com. E-mail: gianluca@gianlucamalato.it