Spark MLlib Python Example — Machine Learning At Scale

Cory Maklin
Towards Data Science
9 min readJun 30, 2019

--

For most of their history, computer processors became faster every year. Unfortunately, this trend in hardware stopped around 2005. Due to limits in heat dissipation, hardware developers stopped increasing the clock frequency of individual processors and opted for parallel CPU cores. This is fine for playing video games on a desktop computer. However, when it involves processing petabytes of…

--

--