EmbiText: Embracing Ambiguity by Annotation, Recognition and Generation of Pronominal Reference with Event-Entity Ambiguity

Publikation: Konference artikel i Proceeding eller bog/rapport kapitelKonferencebidrag i proceedingsForskningpeer review

Abstract

Consider the example “The bird sang the nursery rhyme beautifully. It made everyone in the room smile”. The pronoun ‘it’ here refers either to the bird or to the event of singing. This example is inherently ambiguous. It cannot be meaningfully disambiguated as an event or entity reference, as both readings result in the same text meaning. This study introduces a new dataset EMBITEXT to preserve ambiguity in the language by navigating through the ambiguity surrounding the pronominal reference to the entity or event. Oftentimes, ambiguity does not necessarily need to be resolved but is modelled carefully. Furthermore, this study explores the capacity of LLMs (Llama, Mistral, Gemini, Claude AI) to embrace ambiguity in generating text that exhibit referential ambiguity via an In-Context learning approach. To evaluate of the dataset, RoBERTa was finetuned on this data to model ambiguity while simultaneously distinguishing between entity or event references. Results demonstrate EmbiText’s capacity to advance the ongoing NLP research by modelling linguistic ambiguity in computational environments instead of fully disambiguating it, thereby retaining diverse interpretations where resolution may alter meaning.
OriginalsprogEngelsk
TitelProceedings of the 6th Workshop on Computational Approaches to Discourse, Context and Document-Level Inferences
Antal sider9
ForlagAssociation for Computational Linguistics
Publikationsdatonov. 2025
Sider157-165
ISBN (Elektronisk)979-8-89176-343-2
DOI
StatusUdgivet - nov. 2025
BegivenhedWorkshop on Computational Approaches to Discourse, Context and Document-Level Inferences - Suzhou International Expo Centre, Suzhou, Kina
Varighed: 8 nov. 20259 nov. 2025
Konferencens nummer: 6
https://2025.emnlp.org/program/workshops/?utm

Workshop

WorkshopWorkshop on Computational Approaches to Discourse, Context and Document-Level Inferences
Nummer6
LokationSuzhou International Expo Centre
Land/OmrådeKina
BySuzhou
Periode08/11/202509/11/2025
Internetadresse

Fingeraftryk

Dyk ned i forskningsemnerne om 'EmbiText: Embracing Ambiguity by Annotation, Recognition and Generation of Pronominal Reference with Event-Entity Ambiguity'. Sammen danner de et unikt fingeraftryk.

Citationsformater