Emergent social conventions and collective bias in LLM populations

Publikation: Artikel i tidsskrift og konference artikel i tidsskriftTidsskriftartikelForskningpeer review

Abstract

Social conventions are the backbone of social coordination, shaping how individuals form a group. As growing populations of artificial intelligence (AI) agents communicate through natural language, a fundamental question is whether they can bootstrap the foundations of a society. Here, we present experimental results that demonstrate the spontaneous emergence of universally adopted social conventions in decentralized populations of large language model (LLM) agents. We then show how strong collective biases can emerge during this process, even when agents exhibit no bias individually. Last, we examine how committed minority groups of adversarial LLM agents can drive social change by imposing alternative social conventions on the larger population. Our results show that AI systems can autonomously develop social conventions without explicit programming and have implications for designing AI systems that align, and remain aligned, with human values and societal goals.
OriginalsprogEngelsk
TidsskriftScience Advances
ISSN2375-2548
DOI
StatusUdgivet - maj 2025

Fingeraftryk

Dyk ned i forskningsemnerne om 'Emergent social conventions and collective bias in LLM populations'. Sammen danner de et unikt fingeraftryk.

Citationsformater