Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Introducing the CLEF 2020 HIPE Shared Task: Named Entity Recognition and Linking on Historical Newspapers
 
conference paper

Introducing the CLEF 2020 HIPE Shared Task: Named Entity Recognition and Linking on Historical Newspapers

Ehrmann, Maud  
•
Romanello, Matteo  
•
Bircher, Stefan
Show more
Jose, Joemon M.
•
Yilmaz, Emine
Show more
April 8, 2020
Advances in Information Retrieval. ECIR 2020
ECIR 2020 : 42nd European Conference on Information Retrieval

Since its introduction some twenty years ago, named entity (NE) processing has become an essential component of virtually any text mining application and has undergone major changes. Recently, two main trends characterise its developments: the adoption of deep learning architectures and the consideration of textual material originating from historical and cultural heritage collections. While the former opens up new opportunities, the latter introduces new challenges with heterogeneous, historical and noisy inputs. If NE processing tools are increasingly being used in the context of historical documents, performance values are below the ones on contemporary data and are hardly comparable. In this context, this paper introduces the CLEF 2020 Evaluation Lab HIPE (Identifying Historical People, Places and other Entities) on named entity recognition and linking on diachronic historical newspaper material in French, German and English. Our objective is threefold: strengthening the robustness of existing approaches on non-standard inputs, enabling performance comparison of NE processing on historical texts, and, in the long run, fostering efficient semantic indexing of historical documents in order to support scholarship on digital cultural heritage collections.

  • Files
  • Details
  • Metrics
Type
conference paper
DOI
10.1007/978-3-030-45442-5_68
Author(s)
Ehrmann, Maud  
Romanello, Matteo  
Bircher, Stefan
Clematide, Simon
Editors
Jose, Joemon M.
•
Yilmaz, Emine
•
Magalhães, João
•
Castells, Pablo
•
Ferro, Nicola
•
Silva, Mário J.
•
Martins, Flávio
Date Issued

2020-04-08

Publisher

Springer International Publishing

Publisher place

Cham

Published in
Advances in Information Retrieval. ECIR 2020
ISBN of the book

978-3-030-45442-5

Total of pages

8

Series title/Series vol.

Lecture Notes in Computer Science; 12036

Volume

12036

Start page

524

End page

532

Subjects

Named entity processing

•

Text understanding

•

Information extraction

•

Historical newspapers

•

Digital Humanities

URL

Publisher link

https://doi.org/10.1007/978-3-030-45442-5_68
Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
DHLAB  
Event nameEvent placeEvent date
ECIR 2020 : 42nd European Conference on Information Retrieval

Lisbon, Portugal

April 14-17, 2020

RelationURL/DOI

IsSupplementedBy

https://zenodo.org/record/3677171

IsSupplementedBy

https://zenodo.org/record/3604227

IsSupplementedBy

https://zenodo.org/deposit/3706857
Available on Infoscience
April 15, 2020
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/168173
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés