Project Details
Description
The goal of this project is to improve the role of demographic meta-data in machine learning benchmarks, focused on computer-aided diagnosis. In clinical settings, meta-data such as age of sex of patient are crucial to the diagnosis of that patients. However, when developing machine learning algorithms for diagnosis, such meta-data is often not taken into account. In fact, our preliminary study shows that 90% of diagnostic algorithms at a recent conference, do not mention such information, in part because medical image data for machine learning do not include meta-data. This can lead to (i) poor and/or biased algorithm predictions, and (ii) underexplored research questions. Addressing the missing meta-data problem is therefore crucial for responsible translation of diagnostic algorithms to real-life settings. This project will consist of a systematic review of the use of meta-data in computer-aided diagnosis benchmark, and investigate how to best include such meta-data when training algorithms.
| Acronym | MMC |
|---|---|
| Status | Finished |
| Effective start/end date | 01/10/2022 → 30/09/2025 |
Funding
- Independent Research Fund Denmark: DKK2,879,780.00
Fingerprint
Explore the research topics touched on by this project. These labels are generated based on the underlying awards/grants. Together they form a unique fingerprint.
Research output
- 3 Article in proceedings
-
Augmenting Chest X-ray Datasets with Non-Expert Annotations
Cheplygina, V., Damgaard, C., Eriksen, T. N., Juodelyte, D. & Jiménez-Sánchez, A., 17 Jul 2025, Medical Image Understanding and Analysis. Springer, p. 133-144 11 p. (Lecture Notes in Computer Science, Vol. 15916).Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
-
In the Picture: Medical Imaging Datasets, Artifacts, and their Living Review
Jiménez-Sánchez, A., Avlona, N.-R., de Boer, S., Campello, V. M., Feragen, A., Ferrante, E., Ganz, M., Gichoya, J. W., Gonzalez, C., Groefsema, S., Hering, A., Hulman, A., Joskowicz, L., Juodelyte, D., Kandemir, M., Kooi, T., Lérida, J. D. P., Li, L. Y., Pacheco, A. & Rädsch, T. & 9 others, , 23 Jun 2025, FAccT '25: Proceedings of the 2025 ACM Conference on Fairness, Accountability, and Transparency. New York: Association for Computing Machinery, p. 511-531 20 p.Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
Open Access -
Copycats: the many lives of a publicly available medical imaging dataset
Jiménez Sánchez, A., Avlona, N.-R., Juodelyte, D., Sourget, T., Vang-Larsen, C., Rogers, A., Zajac, H. D. & Cheplygina, V., 26 Sept 2024, Advances in Neural Information Processing Systems 38 (NeurIPS 2024) : Datasets and Benchmarks Track. 2024 ed. Vol. NeurIPS. 22 p.Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
Open Access
Datasets
-
NEATX: Non-Expert Annotations of Tubes in X-rays
Cheplygina, V. (Creator), Cathrine, D. (Creator), Eriksen, T. N. (Creator), Jiménez-Sánchez, A. (Creator) & Juodelyte, D. (Contributor), ZENODO, 28 Feb 2025
DOI: 10.5281/zenodo.14944064, https://zenodo.org/records/14944064
Dataset