Skip to main navigation Skip to search Skip to main content

Automatic reference-based evaluation of pronoun translation misses the point

  • University of Edinburgh
  • Uppsala University

Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

Abstract

We compare the performance of the APT and AutoPRF metrics for pronoun translation against a manually annotated dataset comprising human judgements as to the correctness of translations of the PROTEST test suite. Although there is some correlation with the human judgements, a range of issues limit the performance of the automated metrics. Instead, we recommend the use of semiautomatic metrics and test suites in place of fully automatic metrics.
Original languageEnglish
Title of host publicationProceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, EMNLP 2018
Publication date2018
ISBN (Print)9781948087841
DOIs
Publication statusPublished - 2018
Externally publishedYes

Keywords

  • pronoun translation evaluation
  • APT
  • AutoPRF
  • PROTEST test suite
  • semiautomatic metrics

Fingerprint

Dive into the research topics of 'Automatic reference-based evaluation of pronoun translation misses the point'. Together they form a unique fingerprint.

Cite this