Abstract
Cross-lingual transfer of parsing models has been shown to work well for several closely-related languages, but predicting the success in other cases remains hard. Our study is a comprehensive analysis of the impact of linguistic distance on the transfer of UD parsers. As an alternative to syntactic typological distances extracted from URIEL, we propose three text-based feature spaces and show that they can be more precise predictors, especially on a more local scale, when only shorter distances are taken into account. Our analyses also reveal that the good coverage in typological databases is not among the factors that explain good transfer.
Originalsprog | Engelsk |
---|---|
Titel | Proceedings of the 26th Conference on Computational Natural Language Learning (CoNLL) |
Antal sider | 16 |
Vol/bind | 26 |
Forlag | Association for Computational Linguistics |
Publikationsdato | dec. 2023 |
Sider | 266-281 |
DOI | |
Status | Udgivet - dec. 2023 |
Begivenhed | The SIGNLL Conference on Computational Natural Language Learning - Abu Dhabi, United Arab Emirates Varighed: 7 dec. 2022 → 8 dec. 2022 Konferencens nummer: 26 https://conll.org/ |
Konference
Konference | The SIGNLL Conference on Computational Natural Language Learning |
---|---|
Nummer | 26 |
Land/Område | United Arab Emirates |
By | Abu Dhabi |
Periode | 07/12/2022 → 08/12/2022 |
Internetadresse |