On Language Spaces, Scales and Cross-Lingual Transfer of UD Parsers

Tanja Samardžić, Ximena Gutierrez-Vasques, Rob van der Goot, Max Müller-Eberstein, Olga Pelloni, Barbara Plank

Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

Abstract

Cross-lingual transfer of parsing models has been shown to work well for several closely-related languages, but predicting the success in other cases remains hard. Our study is a comprehensive analysis of the impact of linguistic distance on the transfer of UD parsers. As an alternative to syntactic typological distances extracted from URIEL, we propose three text-based feature spaces and show that they can be more precise predictors, especially on a more local scale, when only shorter distances are taken into account. Our analyses also reveal that the good coverage in typological databases is not among the factors that explain good transfer.
Original languageEnglish
Title of host publicationProceedings of the 26th Conference on Computational Natural Language Learning (CoNLL)
Number of pages16
Volume26
PublisherAssociation for Computational Linguistics
Publication dateDec 2023
Pages266-281
DOIs
Publication statusPublished - Dec 2023
EventThe SIGNLL Conference on Computational Natural Language Learning - Abu Dhabi, United Arab Emirates
Duration: 7 Dec 20228 Dec 2022
Conference number: 26
https://conll.org/

Conference

ConferenceThe SIGNLL Conference on Computational Natural Language Learning
Number26
Country/TerritoryUnited Arab Emirates
CityAbu Dhabi
Period07/12/202208/12/2022
Internet address

Keywords

  • Natural Language Processing

Fingerprint

Dive into the research topics of 'On Language Spaces, Scales and Cross-Lingual Transfer of UD Parsers'. Together they form a unique fingerprint.

Cite this