Why ‘Learn To Forget’ in Recurrent Neural Networks

Illustrated with a simple example

Published in

Towards Data Science

7 min readJan 2, 2021

Consider the following binary classification problem. The input is a binary sequence of arbitrary length. We want the output to be 1 if and only if a 1 occurred in the input but not too recently. Specifically, the last n bits must be 0.

Why ‘Learn To Forget’ in Recurrent Neural Networks

Illustrated with a simple example

Written by Arun Jagota