Cross-lingual tagger evaluation without test data

Zeljko Agic, Barbara Plank, Anders Søgaard

Publikation: Konference artikel i Proceeding eller bog/rapport kapitelKonferencebidrag i proceedingsForskningpeer review

Abstract

We address the challenge of cross-lingual POS tagger evaluation in absence of manually annotated test data. We put forth and evaluate two dictionary-based metrics. On the tasks of accuracy prediction and system ranking, we reveal that these metrics are reliable enough to approximate test set-based evaluation, and at the same time lean enough to support assessment for truly low-resource languages.
OriginalsprogEngelsk
TitelProceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics
ForlagAssociation for Computational Linguistics
Publikationsdato2017
Sider248-253
ISBN (Elektronisk)978-1-945626-35-7
StatusUdgivet - 2017
BegivenhedThe 15th Conference of the European Chapter of the Association for Computational Linguistics - Valencia, Spanien
Varighed: 3 apr. 20177 apr. 2017
http://eacl2017.org/

Konference

KonferenceThe 15th Conference of the European Chapter of the Association for Computational Linguistics
Land/OmrådeSpanien
ByValencia
Periode03/04/201707/04/2017
Internetadresse

Emneord

  • Cross-lingual POS tagging
  • Dictionary-based evaluation metrics
  • Accuracy prediction
  • System ranking
  • Low-resource languages assessment

Fingeraftryk

Dyk ned i forskningsemnerne om 'Cross-lingual tagger evaluation without test data'. Sammen danner de et unikt fingeraftryk.

Citationsformater