Big Data From B to A: The Hadoop Distributed Filesystem — HDFS

The guide to understand HDFS concepts

Hajar Khizou
Towards Data Science
5 min readNov 25, 2019

--

Photo by imgix on Unsplash

As data is significantly growing, storing large amounts of information across a network of machines becomes a necessity. Therefore, comes the need for a reliable system, called distributed filesystems, to control how data is stored and retrieved. However, many challenges emerge with the…

--

--