Abstract
We describe Docent, an open-source decoder for statistical machine translation that breaks with the usual sentence-by-sentence paradigm and translates complete documents as units. By taking translation to the document level, our decoder can handle feature models with arbitrary discourse-wide dependencies and constitutes an essential infrastructure component in the quest for discourse-aware SMT models.
| Originalsprog | Engelsk |
|---|---|
| Titel | Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics: System Demonstrations |
| Publikationsdato | 9 aug. 2013 |
| Status | Udgivet - 9 aug. 2013 |
| Udgivet eksternt | Ja |