Similarity Metrics in NLP

Euclidean distance, dot product, and cosine similarity

James Briggs
Towards Data Science
5 min readApr 14, 2021

--

Image by the author

When we convert language into a machine-readable format, the standard approach is to use dense vectors.

A neural network typically generates dense vectors. They allow us to convert words and sentences into high-dimensional vectors — organized so that each vector's geometric position can attribute…

--

--