Multimodal Extraction and Recognition of Arabic Implicit Discourse Relations

Ahmed Ruby, Christian Hardmeier, Sara Stymne

Publikation: Konference artikel i Proceeding eller bog/rapport kapitelKonferencebidrag i proceedingsForskningpeer review

Abstract

Most research on implicit discourse relation identification has focused on written language, however, it is also crucial to understand these relations in spoken discourse. We introduce a novel method for implicit discourse relation identification across both text and speech, that allows us to extract examples of semantically equivalent pairs of implicit and explicit discourse markers, based on aligning speech+transcripts with subtitles in another language variant. We apply our method to Egyptian Arabic, resulting in a novel high-quality dataset of spoken implicit discourse relations. We present a comprehensive approach to modeling implicit discourse relation classification using audio and text data with a range of different models. We find that text-based models outperform audio-based models, but combining text and audio features can lead to enhanced performance.
OriginalsprogEngelsk
TitelProceedings of the 31st International Conference on Computational Linguistics (COLING)
Antal sider4
ForlagAssociation for Computational Linguistics
Publikationsdatojan. 2025
Sider5425-5429
StatusUdgivet - jan. 2025
Begivenhed31st International Conference on Computational Linguistics - Abu Dhabi, United Arab Emirates
Varighed: 19 jan. 202524 jan. 2025
Konferencens nummer: 31

Konference

Konference31st International Conference on Computational Linguistics
Nummer31
Land/OmrådeUnited Arab Emirates
ByAbu Dhabi
Periode19/01/202524/01/2025

Fingeraftryk

Dyk ned i forskningsemnerne om 'Multimodal Extraction and Recognition of Arabic Implicit Discourse Relations'. Sammen danner de et unikt fingeraftryk.

Citationsformater