Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Low-Rank Subspaces for Unsupervised Entity Linking
 
conference paper

Low-Rank Subspaces for Unsupervised Entity Linking

Arora, Akhil  
•
Garcia-Duran, Alberto  
•
West, Robert  
January 1, 2021
2021 Conference On Empirical Methods In Natural Language Processing (Emnlp 2021)
Conference on Empirical Methods in Natural Language Processing (EMNLP)

Entity linking is an important problem with many applications. Most previous solutions were designed for settings where annotated training data is available, which is, however, not the case in numerous domains. We propose a light-weight and scalable entity linking method, EIGENTHEMES, that relies solely on the availability of entity names and a referent knowledge base. EIGENTHEMES exploits the fact that the entities that are truly mentioned in a document (the "gold entities") tend to form a semantically dense subset of the set of all candidate entities in the document. Geometrically speaking, when representing entities as vectors via some given embedding, the gold entities tend to lie in a low-rank subspace of the full embedding space. EIGENTHEMES identifies this subspace using the singular value decomposition and scores candidate entities according to their proximity to the subspace. On the empirical front, we introduce multiple strong baselines that compare favorably to (and sometimes even outperform) the existing state of the art. Extensive experiments on benchmark datasets from a variety of real-world domains showcase the effectiveness of our approach.

  • Details
  • Metrics
Type
conference paper
DOI
10.18653/v1/2021.emnlp-main.634
Web of Science ID

WOS:000860727002012

Author(s)
Arora, Akhil  
•
Garcia-Duran, Alberto  
•
West, Robert  
Date Issued

2021-01-01

Publisher

ASSOC COMPUTATIONAL LINGUISTICS-ACL

Publisher place

Stroudsburg

Published in
2021 Conference On Empirical Methods In Natural Language Processing (Emnlp 2021)
ISBN of the book

978-1-955917-09-4

Start page

8037

End page

8054

Subjects

Computer Science, Artificial Intelligence

•

Computer Science, Interdisciplinary Applications

•

Linguistics

•

Computer Science

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
DLAB  
Event nameEvent placeEvent date
Conference on Empirical Methods in Natural Language Processing (EMNLP)

Punta Cana, DOMINICAN REP

Nov 07-11, 2021

Available on Infoscience
December 5, 2022
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/192932
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés