Cross-lingual tagger evaluation without test data

Zeljko Agic, Barbara Plank, Anders Søgaard

Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

Abstract

We address the challenge of cross-lingual POS tagger evaluation in absence of manually annotated test data. We put forth and evaluate two dictionary-based metrics. On the tasks of accuracy prediction and system ranking, we reveal that these metrics are reliable enough to approximate test set-based evaluation, and at the same time lean enough to support assessment for truly low-resource languages.
Original languageEnglish
Title of host publicationProceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics
PublisherAssociation for Computational Linguistics
Publication date2017
Pages248-253
ISBN (Electronic)978-1-945626-35-7
Publication statusPublished - 2017
EventThe 15th Conference of the European Chapter of the Association for Computational Linguistics - Valencia, Spain
Duration: 3 Apr 20177 Apr 2017
http://eacl2017.org/

Conference

ConferenceThe 15th Conference of the European Chapter of the Association for Computational Linguistics
Country/TerritorySpain
CityValencia
Period03/04/201707/04/2017
Internet address

Keywords

  • Cross-lingual POS tagging
  • Dictionary-based evaluation metrics
  • Accuracy prediction
  • System ranking
  • Low-resource languages assessment

Fingerprint

Dive into the research topics of 'Cross-lingual tagger evaluation without test data'. Together they form a unique fingerprint.

Cite this