Abstract
Cross-lingual transfer of parsing models has been shown to work well for several closely-related languages, but predicting the success in other cases remains hard. Our study is a comprehensive analysis of the impact of linguistic distance on the transfer of UD parsers. As an alternative to syntactic typological distances extracted from URIEL, we propose three text-based feature spaces and show that they can be more precise predictors, especially on a more local scale, when only shorter distances are taken into account. Our analyses also reveal that the good coverage in typological databases is not among the factors that explain good transfer.
Original language | English |
---|---|
Title of host publication | Proceedings of the 26th Conference on Computational Natural Language Learning (CoNLL) |
Number of pages | 16 |
Volume | 26 |
Publisher | Association for Computational Linguistics |
Publication date | Dec 2023 |
Pages | 266-281 |
DOIs | |
Publication status | Published - Dec 2023 |
Event | The SIGNLL Conference on Computational Natural Language Learning - Abu Dhabi, United Arab Emirates Duration: 7 Dec 2022 → 8 Dec 2022 Conference number: 26 https://conll.org/ |
Conference
Conference | The SIGNLL Conference on Computational Natural Language Learning |
---|---|
Number | 26 |
Country/Territory | United Arab Emirates |
City | Abu Dhabi |
Period | 07/12/2022 → 08/12/2022 |
Internet address |
Keywords
- Natural Language Processing