Multilingual projection for parsing truly low-resource languages

Zeljko Agic, Anders Trærup Johannsen, Barbara Plank, Hector Martinez Alonso, Natalie Schluter, Anders Søgaard

Research output: Journal Article or Conference Article in JournalJournal articleResearchpeer-review

Abstract

We propose a novel approach to cross-lingual part-of-speech tagging and dependency parsing for truly low-resource languages. Our annotation projection-based approach yields tagging and parsing models for over 100 languages. All that is needed are freely available parallel texts, and taggers and parsers for resource-rich languages. The empirical evaluation across 30 test languages shows that our method consistently provides top-level accuracies, close to established upper bounds, and outperforms several competitive baselines.
Original languageEnglish
JournalTransactions of the Association for Computational Linguistics
Volume4
Pages (from-to)301-312
ISSN2307-387X
Publication statusPublished - Jul 2016
Externally publishedYes

Keywords

  • Cross-lingual part-of-speech tagging
  • Dependency parsing
  • Low-resource languages
  • Annotation projection
  • Parallel texts

Fingerprint

Dive into the research topics of 'Multilingual projection for parsing truly low-resource languages'. Together they form a unique fingerprint.

Cite this