Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Diachronic Evaluation of NER Systems on Old Newspapers
 
conference paper

Diachronic Evaluation of NER Systems on Old Newspapers

Ehrmann, Maud  
•
Colavizza, Giovanni  
•
Rochat, Yannick  
Show more
Dipper, Stephanie
•
Neubarth, Friedrich
Show more
2016
Proceedings of the 13th Conference on Natural Language Processing (KONVENS 2016)
13th Conference on Natural Language Processing (KONVENS 2016)Conference on Natural Language Processing

In recent years, many cultural institutions have engaged in large-scale newspaper digitization projects and large amounts of historical texts are being acquired (via transcription or OCRization). Beyond document preservation, the next step consists in providing an enhanced access to the content of these digital resources. In this regard, the processing of units which act as referential anchors, namely named entities (NE), is of particular importance. Yet, the application of standard NE tools to historical texts faces several challenges and performances are often not as good as on contemporary documents. This paper investigates the performances of different NE recognition tools applied on old newspapers by conducting a diachronic evaluation over 7 time-series taken from the archives of Swiss newspaper Le Temps.

  • Files
  • Details
  • Metrics
Type
conference paper
Author(s)
Ehrmann, Maud  
Colavizza, Giovanni  
Rochat, Yannick  
Kaplan, Frédéric
Editors
Dipper, Stephanie
•
Neubarth, Friedrich
•
Zinsmeister, Heike
Date Issued

2016

Publisher

Bochumer Linguistische Arbeitsberichte

Publisher place

Bochum, Germany

Published in
Proceedings of the 13th Conference on Natural Language Processing (KONVENS 2016)
Start page

97

End page

107

Subjects

named entities

•

evaluation

•

historical newspapers

•

digital humanities

•

natural language processing

URL

URL

https://www.linguistics.rub.de/konvens16/pub/13_konvensproc.pdf
Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
DHLAB  
Event nameEvent placeEvent date
13th Conference on Natural Language Processing (KONVENS 2016)Conference on Natural Language Processing

Bochum, GermanyBochum, Germany

September 19-21, 2016September 19–21, 2016

Available on Infoscience
September 18, 2016
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/129441
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés