PEX — The secret sauce for the perfect PySpark deployment of AWS EMR workloads

How to use PEX to speed up deployment of PySpark applications on ephemeral AWS EMR clusters

Jan Teichmann
Towards Data Science
8 min readApr 23, 2020

--

[OC]

In the big data and data science world Spark has become a gold standard for almost anything else than deep learning:

  • ELT for data lakes replacing the more…

--

--