The Long and Short of It: Proportion-Based Relevance to Capture Document Semantics End-to-End

Anthony Alcaraz
Towards Data Science
5 min readNov 25, 2023

--

Artificial intelligence software was used to enhance the grammar, flow, and readability of this article’s text.

Dominant search methods today typically rely on keywords matching or vector space similarity to estimate relevance between a query and documents. However, these techniques struggle when it comes to searching corpora using entire files, papers or even books as search queries.

--

--

Chief AI Officer & Architect : Builder of Neuro-Symbolic AI Systems @Fribl enhanced GenAI for HR https://topmate.io/alcaraz_anthony (Book a session)