Abstract
Most research in Relation Extraction (RE) involves the English language, mainly due to the lack of multi-lingual resources. We propose Multi-CrossRE, the broadest multi-lingual dataset for RE, including 26 languages in addition to English, and covering six text domains. Multi-CrossRE is a machine translated version of CrossRE (Bassignana and Plank, 2022), with a sub-portion including more than 200 sentences in seven diverse languages checked by native speakers. We run a baseline model over the 26 new datasets and–as sanity check–over the 26 back-translations to English. Results on the back-translated data are consistent with the ones on the original English CrossRE, indicating high quality of the translation and the resulting dataset.
| Originalsprog | Engelsk |
|---|---|
| Titel | Proceedings of the 24th Nordic Conference on Computational Linguistics (NoDaLiDa |
| Forlag | University of Tartu Library |
| Publikationsdato | 2023 |
| Sider | 80 - 85 |
| Status | Udgivet - 2023 |
| Begivenhed | Nordic Conference on Computational Linguistics - Tórshavn, Færøerne Varighed: 22 maj 2023 → 24 maj 2023 Konferencens nummer: 24 https://www.nodalida2023.fo/ |
Konference
| Konference | Nordic Conference on Computational Linguistics |
|---|---|
| Nummer | 24 |
| Land/Område | Færøerne |
| By | Tórshavn |
| Periode | 22/05/2023 → 24/05/2023 |
| Internetadresse |
Emneord
- Relation Extraction
- Multi-lingual Dataset
- Machine Translation
- CrossRE
- Text Domains
Fingeraftryk
Dyk ned i forskningsemnerne om 'Multi-CrossRE A Multi-Lingual Multi-Domain Dataset for Relation Extraction'. Sammen danner de et unikt fingeraftryk.Citationsformater
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver