Skip to main navigation Skip to search Skip to main content

Docent: A Document-Level Decoder for Phrase-Based Statistical Machine Translation

  • Uppsala University

Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

Abstract

We describe Docent, an open-source decoder for statistical machine translation that breaks with the usual sentence-by-sentence paradigm and translates complete documents as units. By taking translation to the document level, our decoder can handle feature models with arbitrary discourse-wide dependencies and constitutes an essential infrastructure component in the quest for discourse-aware SMT models.
Original languageEnglish
Title of host publicationProceedings of the 51st Annual Meeting of the Association for Computational Linguistics: System Demonstrations
Publication date9 Aug 2013
Publication statusPublished - 9 Aug 2013
Externally publishedYes

Keywords

  • Document-level translation
  • Statistical machine translation
  • Discourse-aware SMT
  • Feature models for SMT
  • Open-source decoder

Fingerprint

Dive into the research topics of 'Docent: A Document-Level Decoder for Phrase-Based Statistical Machine Translation'. Together they form a unique fingerprint.

Cite this