Measuring string similarity in BigQuery using SQL

Use Levenshtein distance to discover similar or duplicated values, clean your data, and more

Romain Granger
Towards Data Science
5 min readJun 13, 2022

--

Photo by Suraj Kardile on Unsplash

Using the Levenshtein distance method

This method can be used among others (Soundex, LIKE statement, Regexp) to perform string similarity or string matching in order to…

--

--

📊 Data Scientist | SQL, BigQuery, Python | Follow me to learn about SQL and data-related topics. Thank you so much for your support!