Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Problems and Procedures to Make Wordnet Data (Retro)Fit for a Multilingual Dictionary
 
conference paper

Problems and Procedures to Make Wordnet Data (Retro)Fit for a Multilingual Dictionary

Benjamin, Martin  
Mititelu, Verginica
•
Forascu, Corina
Show more
2016
Proceedings of the Eighth Global WordNet Conference
Global WordNet Conference 2016

The data compiled through many Wordnet projects can be a rich source of seed information for a multilingual dictionary. However, the original Princeton WordNet was not intended as a dictionary per se, and spawning other languages from it introduces inherent ambiguity that confounds precise inter-lingual linking. This paper discusses a new presentation of existing Wordnet data that displays joints (distance between predicted links) and substitution (degree of equivalence between confirmed pairs) as a two-tiered horizontal ontology. Improvements to make Wordnet data function as lexicography include term-specific English definitions where the topical synset glosses are inadequate, validation of mappings between each member of an English synset and each member of the synsets from other languages, removal of erroneous translation terms, creation of own-language definitions for the many languages where those are absent, and validation of predicted links between non-English pairs. The paper describes the current state and future directions of a system to crowdsource human review and expansion of Wordnet data, using gamification to build consensus validated, dictionary caliber data for languages now in the Global WordNet as well as new languages that do not have formal Wordnet projects of their own.

  • Files
  • Details
  • Metrics
Type
conference paper
Author(s)
Benjamin, Martin  
Editors
Mititelu, Verginica
•
Forascu, Corina
•
Fellbaum, Christiane
•
Vossen, Piek
Date Issued

2016

Publisher place

Bucharest, Romania

Published in
Proceedings of the Eighth Global WordNet Conference
ISBN of the book

978-973-0-20728-6

Start page

26

End page

33

Subjects

Wordnet

•

multilingual lexicography

•

crowdsourcing

•

ontology

•

gamification

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
LSIR  
Event nameEvent placeEvent date
Global WordNet Conference 2016

Bucharest, Romania

27-30 January 2016

Available on Infoscience
October 26, 2016
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/130769
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés