Achieving Greater Self-Consistency in Large Language Models

Anthony Alcaraz
Towards Data Science
8 min readDec 1, 2023

--

Artificial intelligence software was used to enhance the grammar, flow, and readability of this article’s text.

When LLMs are used to evaluate qualities like the correctness, accuracy, or relevance of a piece of text, consistency is paramount. If an LLM exhibits inconsistent judgements, then its evaluations become unreliable and untrustworthy.

If an LLM evaluates the reasoning quality of arguments, but contradicts itself by rating an invalid argument as more logically sound than a perfectly valid one…

--

--

Chief AI Officer & Architect : Builder of Neuro-Symbolic AI Systems @Fribl enhanced GenAI for HR https://topmate.io/alcaraz_anthony (Book a session)