TY - GEN
T1 - A Pronoun Test Suite Evaluation of the English–German MT Systems at WMT 2018
AU - Guillou, Liane
AU - Hardmeier, Christian
AU - Lapshinova-Koltunski, Ekaterina
AU - Loáiciga, Sharid
PY - 2018/11/1
Y1 - 2018/11/1
N2 - We evaluate the output of 16 English-to-German MT systems with respect to the translation of pronouns in the context of the WMT 2018 competition. We work with a test suite specifically designed to assess system quality in various fine-grained categories known to be problematic. The main evaluation scores come from a semi-automatic process, combining automatic reference matching with extensive manual annotation of uncertain cases. We find that current NMT systems are good at translating pronouns with intra-sentential reference, but the inter-sentential cases remain difficult. NMT systems are also good at the translation of event pronouns, unlike systems from the phrase-based SMT paradigm. No single system performs best at translating all types of anaphoric pronouns, suggesting unexplained random effects influencing the translation of pronouns with NMT.
AB - We evaluate the output of 16 English-to-German MT systems with respect to the translation of pronouns in the context of the WMT 2018 competition. We work with a test suite specifically designed to assess system quality in various fine-grained categories known to be problematic. The main evaluation scores come from a semi-automatic process, combining automatic reference matching with extensive manual annotation of uncertain cases. We find that current NMT systems are good at translating pronouns with intra-sentential reference, but the inter-sentential cases remain difficult. NMT systems are also good at the translation of event pronouns, unlike systems from the phrase-based SMT paradigm. No single system performs best at translating all types of anaphoric pronouns, suggesting unexplained random effects influencing the translation of pronouns with NMT.
KW - Machine Translation
KW - Pronoun Translation
KW - Anaphora Resolution
KW - Neural Machine Translation
KW - Semi-automatic Evaluation
UR - https://www.scopus.com/pages/publications/85119383114
U2 - 10.18653/v1/W18-6435
DO - 10.18653/v1/W18-6435
M3 - Article in proceedings
SN - 9781948087810
BT - WMT 2018 - 3rd Conference on Machine Translation, Proceedings of the Conference
ER -