Author: Roman S
-
Check and improve classifier-free guidance for text generation large language models. While participating in NeurIPS…
14 min read -
LLM unlearning without model degradation is achieved through direct training on the replacement data and…
7 min read -
Abstract: applying ~1bit transformer technology to LoRA adapters allows us to reach comparable performance with…
15 min read