How to Train an mT5 Model for Translation With Simple Transformers

The mT5 model is pre-trained on over a hundred different languages. Let’s see how we can leverage this to train a bilingual translation model for a low-resource language — Sinhalese.

Thilina Rajapakse
Towards Data Science
8 min readJan 4, 2021

--

Photo by Alexandr Podvalny on Unsplash — Hikkaduwa, Sri Lanka

--

--