Abstract
We address the challenge of cross-lingual POS tagger evaluation in absence of manually annotated test data. We put forth and evaluate two dictionary-based metrics. On the tasks of accuracy prediction and system ranking, we reveal that these metrics are reliable enough to approximate test set-based evaluation, and at the same time lean enough to support assessment for truly low-resource languages.
Originalsprog | Engelsk |
---|---|
Titel | Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics |
Forlag | Association for Computational Linguistics |
Publikationsdato | 2017 |
Sider | 248-253 |
ISBN (Elektronisk) | 978-1-945626-35-7 |
Status | Udgivet - 2017 |
Begivenhed | The 15th Conference of the European Chapter of the Association for Computational Linguistics - Valencia, Spanien Varighed: 3 apr. 2017 → 7 apr. 2017 http://eacl2017.org/ |
Konference
Konference | The 15th Conference of the European Chapter of the Association for Computational Linguistics |
---|---|
Land/Område | Spanien |
By | Valencia |
Periode | 03/04/2017 → 07/04/2017 |
Internetadresse |
Emneord
- Cross-lingual POS tagging
- Dictionary-based evaluation metrics
- Accuracy prediction
- System ranking
- Low-resource languages assessment