Abstract
We address the challenge of cross-lingual POS tagger evaluation in absence of manually annotated test data. We put forth and evaluate two dictionary-based metrics. On the tasks of accuracy prediction and system ranking, we reveal that these metrics are reliable enough to approximate test set-based evaluation, and at the same time lean enough to support assessment for truly low-resource languages.
Original language | English |
---|---|
Title of host publication | Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics |
Publisher | Association for Computational Linguistics |
Publication date | 2017 |
Pages | 248-253 |
ISBN (Electronic) | 978-1-945626-35-7 |
Publication status | Published - 2017 |
Event | The 15th Conference of the European Chapter of the Association for Computational Linguistics - Valencia, Spain Duration: 3 Apr 2017 → 7 Apr 2017 http://eacl2017.org/ |
Conference
Conference | The 15th Conference of the European Chapter of the Association for Computational Linguistics |
---|---|
Country/Territory | Spain |
City | Valencia |
Period | 03/04/2017 → 07/04/2017 |
Internet address |
Keywords
- Cross-lingual POS tagging
- Dictionary-based evaluation metrics
- Accuracy prediction
- System ranking
- Low-resource languages assessment