Skip to main navigation Skip to search Skip to main content

DaKultur: Evaluating the Cultural Awareness of Language Models for Danish with Native Speakers

Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

Abstract

Large Language Models (LLMs) have seen widespread societal adoption. However, while they are able to interact with users in languages beyond English, they have been shown to lack cultural awareness, providing anglocentric or inappropriate responses for underrepresented language communities. To investigate this gap and disentangle linguistic versus cultural proficiency, we conduct the first cultural evaluation study for the mid-resource language of Danish, in which native speakers prompt different models to solve tasks requiring cultural awareness. Our analysis of the resulting 1,038 interactions from 63 demographically diverse participants highlights open challenges to cultural adaptation: Particularly, how currently employed automatically translated data are insufficient to train or measure cultural adaptation, and how training on native-speaker data can more than double response acceptance rates. We release our study data as DaKultur - the first native Danish cultural awareness dataset.
Original languageEnglish
Title of host publicationProceedings of the 3rd Workshop on Cross-Cultural Considerations in NLP (C3NLP 2025)
Number of pages9
Place of PublicationAlbuquerque, New Mexico
PublisherAssociation for Computational Linguistics
Publication dateMay 2025
Pages50-58
ISBN (Electronic)979-8-89176-237-4
Publication statusPublished - May 2025
EventWorkshop on Cross-Cultural Considerations - Albuquerque, United States
Duration: 4 May 2025 → …
Conference number: 3
https://c3nlp.github.io/2025/

Workshop

WorkshopWorkshop on Cross-Cultural Considerations
Number3
Country/TerritoryUnited States
CityAlbuquerque
Period04/05/2025 → …
Internet address

Keywords

  • Large Language Models
  • Cultural Awareness
  • Danish Language
  • Native-Speaker Data
  • DaKultur Dataset

Fingerprint

Dive into the research topics of 'DaKultur: Evaluating the Cultural Awareness of Language Models for Danish with Native Speakers'. Together they form a unique fingerprint.

Cite this