It-disambiguation and source-aware language models for cross-lingual pronoun prediction

Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

Abstract

We present our systems for the WMT 2016 shared task on cross-lingual pronoun prediction. The main contribution is a classifier used to determine whether an instance of the ambiguous English pronoun “it” functions as an anaphoric, pleonastic or event reference pronoun. For the English-to-French task the classifier is incorporated in an extended baseline, which takes the form of a source-aware language model. An implementation of the sourceaware language model is also provided for each of the remaining language pairs
Original languageEnglish
Title of host publicationProceedings of the First Conference on Machine Translation: Volume 2, Shared Task Papers
Publication date12 Aug 2016
Pages581-588
DOIs
Publication statusPublished - 12 Aug 2016
Externally publishedYes

Keywords

  • cross-lingual pronoun prediction
  • anaphora resolution
  • pleonastic pronoun
  • source-aware language model
  • machine translation

Fingerprint

Dive into the research topics of 'It-disambiguation and source-aware language models for cross-lingual pronoun prediction'. Together they form a unique fingerprint.

Cite this