A Corpus-based Contrastive Analysis for Defining Minimal Semantics of Inter-sentential Dependencies for Machine Translation

Meyer, Thomas; Popescu-Belis, Andrei; Liyanapathirana, Jeevanthi; Cartoni, Bruno

Meyer, Thomas; Popescu-Belis, Andrei; Liyanapathirana, Jeevanthi; Cartoni, Bruno

2011

Download

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

Inter-sentential dependencies such as discourse connectives or pronouns have an impact on the translation of these items. These dependencies have classically been analyzed within complex theoretical frameworks, often monolingual ones, and the resulting fine-grained descriptions, although relevant to translation, are likely beyond reach of statistical machine translation systems. Instead, we propose an approach to search for a minimal, feature-based characterization of translation divergencies due to inter-sentential dependencies, in the case of discourse connectives and pronouns, based on contrastive analyses performed on the Europarl corpus. In addition, we show how to automatically assign labels to connectives and pronouns, and how to use them for statistical machine translation.

Details

Title A Corpus-based Contrastive Analysis for Defining Minimal Semantics of Inter-sentential Dependencies for Machine Translation

Author(s) Meyer, Thomas ; Popescu-Belis, Andrei ; Liyanapathirana, Jeevanthi ; Cartoni, Bruno

Conference Proceedings of the GSCL2011 Workshop on "Contrastive Analysis - Translation Studies - Machine Translation: What can we learn from each other?", Hamburg, Germany

Date 2011

Keywords

Automated Disambiguation; discourse connectives; Machine Translation; Pronouns

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Conference Papers
Work produced at EPFL

Record creation date 2013-12-19