Author: Tula Masterman
-
DeepSeek-R1, OpenAI o1 & o3, Test-Time Compute Scaling, Model Post-Training and the Transition to Reasoning…
11 min read -
A novel approach for lightweight safety classification using pruned language models
13 min read -
Exploring the future of multimodal AI Agents and the Impact of Screen Interaction
8 min read -
Unpacking problem solving and tool-driven decision making in AI
13 min read -
A comprehensive guide to new models GPT4o-mini, Llama 3.1, Mistral NeMo 12B and other GenAI…
8 min read -
Dive into model pre-training, fine-tuning, RAG, prompt engineering, and more!
19 min read -
Evaluating the evolution and application of language models on real world tasks
8 min read