Abstract
The recent emergence and adoption of Machine Learning technology, and specifically of Large Language Models, has drawn attention to the need for systematic and transparent management of language data. This work proposes an approach to global language data governance that attempts to organize data management amongst stakeholders, values, and rights. Our proposal is informed by prior work on distributed governance that accounts for human values and grounded by an international research collaboration that brings together researchers and practitioners from 60 countries. The framework we present is a multi-party international governance structure focused on language data, and incorporating technical and organizational tools needed to support its work.
| Original language | English |
|---|---|
| Title of host publication | 2022 ACM Conference on Fairness, Accountability, and Transparency |
| Number of pages | 17 |
| Place of Publication | New York, NY, USA |
| Publisher | Association for Computing Machinery |
| Publication date | 1 Jun 2022 |
| Pages | 2206-2222 |
| ISBN (Print) | 978-1-4503-9352-2 |
| DOIs | |
| Publication status | Published - 1 Jun 2022 |
| Event | Conference on Fairness, Accountability, and Transparency - Seoul, Korea, Democratic People's Republic of Duration: 21 Jun 2022 → 24 Jun 2022 https://dl.acm.org/doi/proceedings/10.1145/3531146 |
Conference
| Conference | Conference on Fairness, Accountability, and Transparency |
|---|---|
| Country/Territory | Korea, Democratic People's Republic of |
| City | Seoul |
| Period | 21/06/2022 → 24/06/2022 |
| Internet address |
| Series | FAccT '22 |
|---|
Keywords
- Machine Learning
- Large Language Models
- Language Data Governance
- Distributed Governance
- International Research Collaboration
Fingerprint
Dive into the research topics of 'Data Governance in the Age of Large-Scale Data-Driven Language Technology'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver