Going Global —How to Multi-Task in Multiple Languages with the mT5 Transformer

Cross-lingual, zero-shot training with mT5 — Training an mT5 model in English and using it with other languages!

Thilina Rajapakse
Towards Data Science
10 min readDec 16, 2020

--

Photo by Bruno Wolff on Unsplash

The original T5 (Text-To-Text Transfer Transformer) model achieved state-of-the-art performance on a variety of NLP benchmarks by leveraging a…

--

--