Learning Transformers Code First: Part 1 — The Setup

A 4 Part Exploration of Transformers Using nanoGPT as a Starting Point

Lily Hughes-Robinson
Towards Data Science
8 min readJul 7, 2023

--

Photo by Josh Riemer on Unsplash

I don’t know about you, but sometime looking at code is easier than reading papers. When I was working on AdventureGPT, I started by reading the source code to BabyAGI, an implementation of the ReAct paper in around 600 lines of python.

--

--

Lily is a software engineer at a large financial institution. When she isn’t building things with AI, she is hanging out with her wife and toddler in Brooklyn.