000221391 001__ 221391
000221391 005__ 20180814184359.0
000221391 037__ $$aCONF
000221391 245__ $$aDiachronic Evaluation of NER Systems on Old Newspapers
000221391 260__ $$aBochum, Germany$$bBochumer Linguistische Arbeitsberichte$$c2016
000221391 269__ $$a2016
000221391 336__ $$aConference Papers
000221391 520__ $$aIn recent years, many cultural institutions have engaged in large-scale newspaper digitization projects and large amounts of historical texts are being acquired (via transcription or OCRization). Beyond document preservation, the next step consists in providing an enhanced access to the content of these digital resources. In this regard, the processing of units which act as referential anchors, namely named entities (NE), is of particular importance. Yet, the application of standard NE tools to historical texts faces several challenges and performances are often not as good as on contemporary documents. This paper investigates the performances of different NE recognition tools applied on old newspapers by conducting a diachronic evaluation over 7 time-series taken from the archives of Swiss newspaper Le Temps.
000221391 6531_ $$anamed entities
000221391 6531_ $$aevaluation
000221391 6531_ $$ahistorical newspapers
000221391 6531_ $$adigital humanities
000221391 6531_ $$anatural language processing
000221391 700__ $$0248954$$aEhrmann, Maud$$g256249
000221391 700__ $$0248581$$aColavizza, Giovanni$$g242482
000221391 700__ $$0248846$$aRochat, Yannick$$g147407
000221391 700__ $$aKaplan, Frédéric
000221391 7112_ $$a13th Conference on Natural Language Processing (KONVENS 2016)$$cBochum, Germany$$dSeptember 19-21, 2016
000221391 7112_ $$aConference on Natural Language Processing$$cBochum, Germany$$dSeptember 19–21, 2016
000221391 720_1 $$aDipper, Stephanie$$eed.
000221391 720_1 $$aNeubarth, Friedrich$$eed.
000221391 720_1 $$aZinsmeister, Heike$$eed.
000221391 773__ $$q97-107$$tProceedings of the 13th Conference on Natural Language Processing (KONVENS 2016)
000221391 8560_ $$fmaud.ehrmann@epfl.ch
000221391 8564_ $$uhttps://www.linguistics.rub.de/konvens16/pub/13_konvensproc.pdf$$zURL
000221391 8564_ $$s839313$$uhttps://infoscience.epfl.ch/record/221391/files/13_konvensproc.pdf$$yn/a$$zn/a
000221391 909CO $$ooai:infoscience.tind.io:221391$$pCDH$$pconf
000221391 909C0 $$0252465$$pDHLAB$$xU12632
000221391 917Z8 $$x256249
000221391 937__ $$aEPFL-CONF-221391
000221391 973__ $$aEPFL$$rREVIEWED
000221391 980__ $$aCONF