5 Perspectives to Why Dropout Works So Well

In 5 minutes

Andre Ye
Towards Data Science
5 min readAug 8, 2020


Dropout works by randomly blocking off a fraction of neurons in a layer during training. Then, during prediction (after training), Dropout does not block any neurons. The results of this practice have been enormously successful — competition-winning networks almost always make Dropout an essential part of the architecture.

