Spring til hovednavigation Spring til søgning Spring til hovedindhold

IndicBART: A Pre-trained Model for Indic Natural Language Generation.

  • Raj Dabre
  • , Himani Shrotriya
  • , Anoop Kunchukuttan
  • , Ratish Puduppully
  • , Mitesh M. Khapra
  • , Pratyush Kumar
  • National Institute Of Information And Communications Technology, Japan
  • Indian Institute of Technology Madras
  • Microsoft India
  • University of Edinburgh

Publikation: Konference artikel i Proceeding eller bog/rapport kapitelKonferencebidrag i proceedingsForskningpeer review

Abstract

In this paper, we study pre-trained sequence-to-sequence models for a group of related languages, with a focus on Indic languages. We present IndicBART, a multilingual, sequence-to-sequence pre-trained model focusing on 11 Indic languages and English. IndicBART utilizes the orthographic similarity between Indic scripts to improve transfer learning between similar Indic languages. We evaluate IndicBART on two NLG tasks: Neural Machine Translation (NMT) and extreme summarization. Our experiments on NMT and extreme summarization show that a model specific to related languages like IndicBART is competitive with large pre-trained models like mBART50 despite being significantly smaller. It also performs well on very low-resource translation scenarios where languages are not included in pre-training or fine-tuning. Script sharing, multilingual training, and better utilization of limited model capacity contribute to the good performance of the compact IndicBART model.
OriginalsprogEngelsk
TitelFindings of the Association for Computational Linguistics: ACL 2022
ForlagAssociation for Computational Linguistics
Publikationsdato2022
Sider1849-1863
DOI
StatusUdgivet - 2022
Udgivet eksterntJa
BegivenhedConference on the Association for Computational Linguistics - Dublin, Irland
Varighed: 22 maj 202227 maj 2022
Konferencens nummer: 60

Konference

KonferenceConference on the Association for Computational Linguistics
Nummer60
Land/OmrådeIrland
ByDublin
Periode22/05/202227/05/2022

Fingeraftryk

Dyk ned i forskningsemnerne om 'IndicBART: A Pre-trained Model for Indic Natural Language Generation.'. Sammen danner de et unikt fingeraftryk.

Citationsformater