Abstract
We evaluate the output of 16 English-to-German MT systems with respect to the translation of pronouns in the context of the WMT 2018 competition. We work with a test suite specifically designed to assess system quality in various fine-grained categories known to be problematic. The main evaluation scores come from a semi-automatic process, combining automatic reference matching with extensive manual annotation of uncertain cases. We find that current NMT systems are good at translating pronouns with intra-sentential reference, but the inter-sentential cases remain difficult. NMT systems are also good at the translation of event pronouns, unlike systems from the phrase-based SMT paradigm. No single system performs best at translating all types of anaphoric pronouns, suggesting unexplained random effects influencing the translation of pronouns with NMT.
Originalsprog | Engelsk |
---|---|
Titel | WMT 2018 - 3rd Conference on Machine Translation, Proceedings of the Conference |
Publikationsdato | 1 nov. 2018 |
ISBN (Trykt) | 9781948087810 |
DOI | |
Status | Udgivet - 1 nov. 2018 |
Udgivet eksternt | Ja |