Using Python’s datatable library seamlessly on Kaggle

Managing large datasets on Kaggle without fearing about the out of memory error

Parul Pandey
Towards Data Science
5 min readFeb 4, 2021

--

Image by Author

Datatable is a Python package for manipulating large dataframes. It has been created to provide big data support and enable high performance. This toolkit resembles pandas very closely but is more focused on speed. It supports out-of-memory…

--

--

Principal Data Scientist @H2O.ai | Author of Machine Learning for High-Risk Applications