Fuzzy String Matching Algorithms

Levenshtein, Phonetic

Arun Jagota
Towards Data Science
7 min readDec 23, 2021

--

From Pixabay

Often the same entity may be expressed as different strings. For instance, plausible expressions of the first name of the same person. Such as Kathy and Cathy. Or Jonathan and Jonahtan.

Matching and inferring that two strings are plausible expressions of the same entity has several use cases. Such as in web search and in deduping databases of…

--

--

PhD, Computer Science, neural nets. 14+ years in industry: data science algos developer. 24+ patents issued. 50 academic pubs. Blogs on ML/data science topics.