Spark SQL 102 — Aggregations and Window Functions

Analytical functions in Spark for beginners.

David Vrba
Towards Data Science
7 min readJun 30, 2021

--

Data aggregation is an important step in many data analyses. It is a way how to reduce the dataset and compute various metrics, statistics, and other characteristics. A related but slightly more advanced topic are window functions that allow computing also other analytical and ranking functions on the data…

--

--

Senior ML Engineer at Sociabakers and Apache Spark trainer and consultant. I lecture Spark trainings, workshops and give public talks related to Spark.