4 Simple Ways to Import Word and PDF Data into Python when Pandas Fails

A practical guide to import unstructured text/images data

Kat Li
Towards Data Science
5 min readMay 9, 2020

--

Being a part of the data science/analytics team, you’ll probably encounter many file types to import and analyze in Python. In the ideal world, all our data reside in the cloud-based databases (e.g., SQL, NoSQL) that are easy to query and extract. In the real world, however, we rarely get neat tabular data. Also, if we need additional data (being structured or unstructured) to augment the analysis, we will…

--

--