Abstract
We evaluate the output of 16 English-to-German MT systems with respect to the translation of pronouns in the context of the WMT 2018 competition. We work with a test suite specifically designed to assess system quality in various fine-grained categories known to be problematic. The main evaluation scores come from a semi-automatic process, combining automatic reference matching with extensive manual annotation of uncertain cases. We find that current NMT systems are good at translating pronouns with intra-sentential reference, but the inter-sentential cases remain difficult. NMT systems are also good at the translation of event pronouns, unlike systems from the phrase-based SMT paradigm. No single system performs best at translating all types of anaphoric pronouns, suggesting unexplained random effects influencing the translation of pronouns with NMT.
Original language | English |
---|---|
Title of host publication | WMT 2018 - 3rd Conference on Machine Translation, Proceedings of the Conference |
Publication date | 1 Nov 2018 |
ISBN (Print) | 9781948087810 |
DOIs | |
Publication status | Published - 1 Nov 2018 |
Externally published | Yes |