Diachronic Evaluation of NER Systems on Old Newspapers

Détails

Ressource 1Télécharger: 13_konvensproc.pdf (819.64 [Ko])
Etat: Public
Version: Author's accepted manuscript
Licence: Non spécifiée
ID Serval
serval:BIB_D9C0F97FA619
Type
Actes de conférence (partie): contribution originale à la littérature scientifique, publiée à l'occasion de conférences scientifiques, dans un ouvrage de compte-rendu (proceedings), ou dans l'édition spéciale d'un journal reconnu (conference proceedings).
Collection
Publications
Titre
Diachronic Evaluation of NER Systems on Old Newspapers
Titre de la conférence
13th Conference on Natural Language Processing (KONVENS 2016), Bochum, Germany, September 19-21, 2016
Auteur⸱e⸱s
Ehrmann Maud, Colavizza Giovanni, Rochat Yannick
Statut éditorial
Publié
Date de publication
2016
Peer-reviewed
Oui
Langue
anglais
Résumé
In recent years, many cultural institutions have engaged in large-scale newspaper digitization projects and large amounts of historical texts are being acquired (via transcription or OCRization). Beyond document preservation, the next step consists in providing an enhanced access to the con- tent of these digital resources. In this regard, the processing of units which act as referential anchors, namely named entities (NE), is of particular importance. Yet, the application of standard NE tools to historical texts faces several challenges and performances are often not as good as on con- temporary documents. This paper investigates the performances of different NE recognition tools applied on old newspapers by conducting a diachronic evaluation over 7 time-series taken from the archives of Swiss newspaper Le Temps.
Création de la notice
18/10/2019 12:06
Dernière modification de la notice
19/07/2022 9:16
Données d'usage