Spring til hovednavigation Spring til søgning Spring til hovedindhold

Unpacking Ambiguous Structure: A Dataset for Ambiguous Implicit Discourse Relations for English and Egyptian Arabic

  • Uppsala University

Publikation: Konference artikel i Proceeding eller bog/rapport kapitelKonferencebidrag i proceedingsForskningpeer review

Abstract

In this paper, we present principles of constructing and resolving ambiguity in implicit discourse relations. Following these principles, we created a dataset in both English and Egyptian Arabic that controls for semantic disambiguation, enabling the investigation of prosodic features in future work. In these datasets, examples are two-part sentences with an implicit discourse relation that can be ambiguously read as either causal or concessive, paired with two different preceding context sentences forcing either the causal or the concessive reading. We also validated both datasets by humans and language models (LMs) to study whether context can help humans or LMs resolve ambiguities of implicit relations and identify the intended relation. As a result, this task posed no difficulty for humans, but proved challenging for BERT/CamelBERT and ELECTRA/AraELECTRA models.
OriginalsprogEngelsk
TitelProceedings of the 4th Workshop on Computational Approaches to Discourse (CODI 2023)
Antal sider18
UdgivelsesstedCanada
ForlagAssociation for Computational Linguistics
Publikationsdato2023
Sider126-144
ISBN (Elektronisk) 978-1-959429-89-0
DOI
StatusUdgivet - 2023
BegivenhedWorkshop on Computational Approaches to Discourse - Toronto, Canada
Varighed: 9 jul. 202314 jul. 2023
Konferencens nummer: 4
https://sites.google.com/view/codi-2023/home

Workshop

WorkshopWorkshop on Computational Approaches to Discourse
Nummer4
Land/OmrådeCanada
ByToronto
Periode09/07/202314/07/2023
Internetadresse

Emneord

  • Implicit discourse relations
  • Semantic disambiguation
  • Prosodic features
  • Contextual ambiguity
  • Human vs. language model validation

Fingeraftryk

Dyk ned i forskningsemnerne om 'Unpacking Ambiguous Structure: A Dataset for Ambiguous Implicit Discourse Relations for English and Egyptian Arabic'. Sammen danner de et unikt fingeraftryk.

Citationsformater