TY - JOUR
T1 - Multilingual projection for parsing truly low-resource languages
AU - Agic, Zeljko
AU - Johannsen, Anders Trærup
AU - Plank, Barbara
AU - Martinez Alonso, Hector
AU - Schluter, Natalie
AU - Søgaard, Anders
PY - 2016/7
Y1 - 2016/7
N2 - We propose a novel approach to cross-lingual part-of-speech tagging and dependency parsing for truly low-resource languages. Our annotation projection-based approach yields tagging and parsing models for over 100 languages. All that is needed are freely available parallel texts, and taggers and parsers for resource-rich languages. The empirical evaluation across 30 test languages shows that our method consistently provides top-level accuracies, close to established upper bounds, and outperforms several competitive baselines.
AB - We propose a novel approach to cross-lingual part-of-speech tagging and dependency parsing for truly low-resource languages. Our annotation projection-based approach yields tagging and parsing models for over 100 languages. All that is needed are freely available parallel texts, and taggers and parsers for resource-rich languages. The empirical evaluation across 30 test languages shows that our method consistently provides top-level accuracies, close to established upper bounds, and outperforms several competitive baselines.
KW - Cross-lingual part-of-speech tagging
KW - Dependency parsing
KW - Low-resource languages
KW - Annotation projection
KW - Parallel texts
KW - Cross-lingual part-of-speech tagging
KW - Dependency parsing
KW - Low-resource languages
KW - Annotation projection
KW - Parallel texts
M3 - Journal article
SN - 2307-387X
VL - 4
SP - 301
EP - 312
JO - Transactions of the Association for Computational Linguistics
JF - Transactions of the Association for Computational Linguistics
ER -