An improved version of this approach published in Jan 2022 describes how to scale it to a large number of entity types (e.g. 68 entity types spanning the domain of biology and PHI entities such as person, location, organization).
In natural language processing, identifying entities of interest (NER) in a sentence such as person, location, organization etc. requires labeled data…