A Pronoun Test Suite Evaluation of the English–German MT Systems at WMT 2018

Liane Guillou, Christian Hardmeier, Ekaterina Lapshinova-Koltunski, Sharid Loáiciga

Publikation: Konference artikel i Proceeding eller bog/rapport kapitelKonferencebidrag i proceedingsForskningpeer review

Abstract

We evaluate the output of 16 English-to-German MT systems with respect to the translation of pronouns in the context of the WMT 2018 competition. We work with a test suite specifically designed to assess system quality in various fine-grained categories known to be problematic. The main evaluation scores come from a semi-automatic process, combining automatic reference matching with extensive manual annotation of uncertain cases. We find that current NMT systems are good at translating pronouns with intra-sentential reference, but the inter-sentential cases remain difficult. NMT systems are also good at the translation of event pronouns, unlike systems from the phrase-based SMT paradigm. No single system performs best at translating all types of anaphoric pronouns, suggesting unexplained random effects influencing the translation of pronouns with NMT.
OriginalsprogEngelsk
TitelWMT 2018 - 3rd Conference on Machine Translation, Proceedings of the Conference
Publikationsdato1 nov. 2018
ISBN (Trykt)9781948087810
DOI
StatusUdgivet - 1 nov. 2018
Udgivet eksterntJa

Fingeraftryk

Dyk ned i forskningsemnerne om 'A Pronoun Test Suite Evaluation of the English–German MT Systems at WMT 2018'. Sammen danner de et unikt fingeraftryk.

Citationsformater