Getting generative AI to provide references to its training data

Project: Research

Project Details

Description

The current generative AI systems such as ChatGPT are trained on existing texts, but they do not credit their sources. This project will develop a theoretical framework and novel methods for providing references to training data of large language models, which is a prerequisite to systems that are more transparent, trustworthy, and respectful of the rights of the original content creators. The grant will fund two Ph.D. students, one postdoc, and equipment.
AcronymPlagAIrism
StatusActive
Effective start/end date01/05/202430/06/2029

Collaborative partners

Funding

  • Villum Foundation: DKK6,989,227.00

Keywords

  • NLP
  • language models
  • data attribution
  • interpretability

Fingerprint

Explore the research topics touched on by this project. These labels are generated based on the underlying awards/grants. Together they form a unique fingerprint.