Monitoring BERT Model Training with TensorBoard

Gradient Flow and Update Ratios

Alexey Kravets
Towards Data Science
6 min readDec 24, 2021

--

https://unsplash.com/@tobiastu

In the previous article, we explained all the building components of the BERT model. Now we are going to train the model monitoring the training process in TensorBoard, looking at the gradient flow, updates-parameters ratios, loss and evaluation metrics.

--

--