Process Dataset with 200 Million Rows using Vaex

Perform Operations on a large dataset using vaex data frame

Satyam Kumar
Towards Data Science
6 min readJan 17, 2021

--

Image by Gerd Altmann from Pixabay

Pandas is one of the most popular libraries used for data science case studies. It is one of the best tools for exploratory data analysis and data wrangling. Pandas works efficiently well with small or medium-size datasets which fit best into the memory. For out of core dataset or…

--

--