Understanding the Open Pre-Trained Transformers (OPT) Library

Towards transparency and inclusion in language modeling…

Published in

Towards Data Science

15 min readJun 6, 2022

Depiction of a decoder-only language modeling architecture (created by author)

Recently, Meta AI published “OPT: Open Pre-Trained Transformer Language Models” [1] and an associated code repository with the intent of open-sourcing high-performing large language models (LLMs) to the public. In particular, OPT provides an entire suite of LLMs…

Understanding the Open Pre-Trained Transformers (OPT) Library

Towards transparency and inclusion in language modeling…

Written by Cameron R. Wolfe, Ph.D.