Charton, E., Gagnon, M., & Ozell, B. (2011). Automatic semantic web annotation of named entities. In Proceedings of the 24th Canadian conference on Advances in artificial intelligence (pp. 74—85). Berlin, Heidelberg: Springer-Verlag.
SummaryThis article presents a method for identifying named entities in text and linking them to a semantic knowledge base. In contrast to named entity recognition, which focuses on identifying the entity type (i.e. organization, person, location, etc), named entity linking determines which entity (i.e. individual) are mentioned in the text.
MethodThe authors link entities to Wikipedia using the following description for each Wikipedia entity:
- surface forms (i.e. names that refer to this entity)
- entity description (i.e. the entity's context) - the tf/idf values are computed for each word occuring in these descriptions
ExperimentsThe authors applied their approach to
- the French ESTER 2 corpus
- the Wall Street Journal (WSJ) corpus from the CoNLL Shared Task 2008.
- apply the annotator to provide tentative annotations
- manually remove or correct wrong semantic links