Abstract
Most research on implicit discourse relation identification has focused on written language, however, it is also crucial to understand these relations in spoken discourse. We introduce a novel method for implicit discourse relation identification across both text and speech, that allows us to extract examples of semantically equivalent pairs of implicit and explicit discourse markers, based on aligning speech+transcripts with subtitles in another language variant. We apply our method to Egyptian Arabic, resulting in a novel high-quality dataset of spoken implicit discourse relations. We present a comprehensive approach to modeling implicit discourse relation classification using audio and text data with a range of different models. We find that text-based models outperform audio-based models, but combining text and audio features can lead to enhanced performance.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 31st International Conference on Computational Linguistics (COLING) |
| Number of pages | 4 |
| Publisher | Association for Computational Linguistics |
| Publication date | Jan 2025 |
| Pages | 5425-5429 |
| Publication status | Published - Jan 2025 |
| Event | Computational Linguistics - United Arab Emirates, Abu Dhabi, United Arab Emirates Duration: 19 Jan 2025 → 24 Jan 2025 Conference number: 31 https://coling2025.org/ |
Conference
| Conference | Computational Linguistics |
|---|---|
| Number | 31 |
| Location | United Arab Emirates |
| Country/Territory | United Arab Emirates |
| City | Abu Dhabi |
| Period | 19/01/2025 → 24/01/2025 |
| Internet address |
Keywords
- Implicit discourse relations
- Multimodal discourse analysis
- Spoken language processing
- Egyptian Arabic dataset
- Text and audio fusion