How to Replace Null Values in Spark DataFrames

Discussing how to replace null values in PySpark using fillna() and fill()

Giorgos Myrianthous
Towards Data Science
4 min readJul 19, 2021

--

Introduction

The replacement of null values in PySpark DataFrames is one of the most common operations undertaken. This can be achieved by using either DataFrame.fillna() or DataFrameNaFunctions.fill() methods. In today’s…

--

--

I strive to build data-intensive systems that are not only functional, but also scalable, cost effective and maintainable over the long term.