Why ‘Learn To Forget’ in Recurrent Neural Networks
Illustrated with a simple example
Published in
7 min readJan 2, 2021
Consider the following binary classification problem. The input is a binary sequence of arbitrary length. We want the output to be 1 if and only if a 1 occurred in the input but not too recently. Specifically, the last n bits must be 0.