Photo Credit: tian kuan

Distributed Deep Learning Pipelines with PySpark and Keras

An easy approach to data pipelining using PySpark and doing distributed deep learning with Keras

Andre Violante
Towards Data Science
11 min readJun 20, 2019

--

Introduction

In this notebook I use PySpark, Keras, and Elephas python libraries to build an end-to-end deep learning pipeline that runs on Spark. Spark is an open-source…

--

--

Lifelong learner! Doing data science, teaching, and startups. Family and friends first!