Understanding Spark As If You Had Designed It

From a simple function to a resilient and distributed framework.

Felipe Melo
Towards Data Science
12 min readJun 9, 2020

--

Why caring about Spark?

Among the current frameworks available on the data space, just a few have achieved the status that Spark has in terms of adoption and delivery. The frameworks has emerged as one of the clear winners specially on the Data Engineering side of the landscape.

If you are reading this article, it means that you already understand the reasons behind the previous paragraph, so we…

--

--