Natural Language Processing

TF-IDF Simplified

A short introduction to TF-IDF vectorizer

Luthfi Ramadhan
Towards Data Science
4 min readJan 20, 2021


Photo by Jason Leung on Unsplash

Most machine learning algorithms are fulfilled with mathematical things such as statistics, algebra, calculus and etc. They expect the data to be numerical such as a 2-dimensional array with rows as instances and columns as features. The problem with natural language is that the data is in the…

