Why Your RAG Is Not Reliable in a Production Environment

And how you should tune it properly

Ahmed Besbes
Towards Data Science
7 min readOct 12, 2023

--

With the rise of LLMs, the Retrieval Augmented Generation (RAG) framework also gained popularity by making it possible to build question-answering systems over data.

We’ve all seen those demos of chatbots conversing with PDFs or emails.

--

--

Medium Top Writer (+2M views) | I write about python and productionizing ML code into scalable apps. Exclusive content here: https://thetechbuffet.substack.com/