Pre Training
-
This blog post will go line-by-line through the code in Section 3 of Andrej Karpathy’s…
20 min read -
Distributed training with multiple GPU nodes
6 min read
This blog post will go line-by-line through the code in Section 3 of Andrej Karpathy’s…
Distributed training with multiple GPU nodes