Confusion matrix for six-way classification (TF-IDF with Naive Bayes classifier)

A complete NLP classification pipeline in scikit-learn

Go from corpus to classification with this full-on guide for a natural language processing classification pipeline.

Louis de Bruijn
6 min readMar 13, 2020

--

What we’ll cover in this story:

  • Reading a corpus
  • Basic script structure including logging, argparse and ifmain.

--

--