Unsupervised Learning of Representations for Lexical Entailment Detection

Hug, Andreas

master thesis

September 4, 2018

Detecting lexical entailment plays a fundamental role in a variety of natural language processing tasks and is key to language understanding. Unsupervised methods still play an important role due to the lack of coverage of lexical databases in some domains and languages. Most of the previous approaches were either based on statistical hypothesis of specific entailment relations or tried to encode word relations in low-dimensional vector embeddings. This thesis builds upon one of the few approaches which intrinsically model entailment in a vector space. We then further generalize this model by introducing an alternative, distributional representations for words which harnesses tools from optimal transport to define distance or entailment measures between such representations. We evaluated the models on hypernymy detection where our distributional estimate significantly improves over the underlying model and even outperforms state-of-the-art on some datasets.

Use this identifier to reference this record

https://infoscience.epfl.ch/handle/20.500.14299/148605

Name

Master Thesis Report.pdf

Type

Publisher's version

Access type

restricted

Size

456.79 KB

Format

Adobe PDF

Checksum (MD5)

cd42bdd01b207df371c8ddd220dc40e4