Author: Aris Tsakpinis
-
Frugal RLHF with multi-adapter PPO on Amazon SageMaker
32 min read -
Stepping out of the “comfort zone” – part 3/3 of a deep-dive into domain adaptation…
30 min read -
Stepping out of the “comfort zone” – part 2/3 of a deep-dive into domain adaptation…
11 min read -
Stepping out of the “comfort zone” – part 1/3 of a deep-dive into domain adaptation…
15 min read -
Learn how to infuse knowledge into purpose-fine-tuned models while keeping their task-specific nature
14 min read