Use Machine Learning to detect errors in a dataset

How to Use Generalization Languages to find the needle in a haystack

Dimitris Poulopoulos
Towards Data Science
9 min readJan 18, 2021

--

Photo by Reno Laithienne on Unsplash

Corrupt data values in structured datasets are much more common than you would expect. The importance of detecting these errors early is crucial for the performance of downstream analytical tasks. Yet, manually examining each data point is neither efficient nor…

--

--

Machine Learning Engineer. I talk about AI, MLOps, and Python programming. More about me: www.dimpo.me