Skip to main navigation Skip to search Skip to main content

Emergent social conventions and collective bias in LLM populations

  • City, University of London

Research output: Journal Article or Conference Article in JournalJournal articleResearchpeer-review

Abstract

Social conventions are the backbone of social coordination, shaping how individuals form a group. As growing populations of artificial intelligence (AI) agents communicate through natural language, a fundamental question is whether they can bootstrap the foundations of a society. Here, we present experimental results that demonstrate the spontaneous emergence of universally adopted social conventions in decentralized populations of large language model (LLM) agents. We then show how strong collective biases can emerge during this process, even when agents exhibit no bias individually. Last, we examine how committed minority groups of adversarial LLM agents can drive social change by imposing alternative social conventions on the larger population. Our results show that AI systems can autonomously develop social conventions without explicit programming and have implications for designing AI systems that align, and remain aligned, with human values and societal goals.
Original languageEnglish
JournalScience Advances
Volume11
Issue number20
ISSN2375-2548
DOIs
Publication statusPublished - 14 May 2025

Fingerprint

Dive into the research topics of 'Emergent social conventions and collective bias in LLM populations'. Together they form a unique fingerprint.

Cite this