Parquet Best Practices: Discover your Data without loading it

Metadata, Statistics on Row Groups, Partitions discovery, and Repartitioning

Arli
Towards Data Science
8 min readJan 3, 2023

--

If you like to experience Medium yourself, consider supporting me and thousands of other writers by signing up for a membership. It only costs $5 per month, it supports us, writers, greatly, and you get to access all the amazing stories on Medium.

Photo by Jakarta Parquet on Unsplash

--

--

Data Engineer working in the Financial Industry. I write about topics related to Data Engineering, Data Science and Finance.