Data Lake -Comparing Performance of Known Big Data Formats

Performance Comparison of well known Big Data Formats — CSV, JSON, AVRO, PARQUET & ORC

Manoj Kukreja
Towards Data Science
5 min readSep 25, 2020

--

Photo by Mika Baumeister on Unsplash

For the past several years, I have been using all kinds of data formats in Big Data projects. During this time I have strongly favored one format over other — my failures have taught me a few lessons. During my lectures I…

--

--

Author, Big Data Engineering, Data Science, Data Lakes, Cloud Computing and IT security specialist.