Pyspark – Import any data

A brief guide to import data with Spark

Alexandre Warembourg
Towards Data Science
3 min readOct 10, 2019

--

https://upload.wikimedia.org/wikipedia/commons/f/f3/Apache_Spark_logo.svg

With this article, I will start a series of short tutorials on Pyspark, from data pre-processing to modeling. The first will deal with the import and export of any type of data, CSV , text file, Avro, Json …etc. I work on a virtual machine on google cloud platform data comes from a bucket on cloud storage. Let’s import them.

--

--