Gpt
-
Scaling from 117M to 175B: Insights into GPT-2 and GPT-3.
10 min read -
Understanding the Evolution of ChatGPT: Part 1-An In-Depth Look at GPT-1 and What Inspired It
Deep LearningTracing the roots of ChatGPT: GPT-1, the foundation of OpenAI’s LLMs
11 min read -
Understanding and implementing the GPT-1, GPT-2 and GPT-3 architectures
31 min read -
In this companion article, I’ll show my implementation for training from scratch a GPT-like model,…
16 min read -
Automating Scientific Code Documentation: A GPT-Powered POC for Streamlined Workflows
13 min read -
What exactly do you put in, what exactly do you get out, and how do…
17 min read -
An engineer’s journey to building LLM-native applications
9 min read -
Prompting GPT to form and solve the linear equations using PuLP
11 min read -
Existence of under-trained and unused tokens and Identification Techniques using GPT-2 Small as an Example
8 min read