Sharing sensitive data in life sciences: an overview of centralized and federated approaches.

Details

Serval ID
serval:BIB_05960E7652A1
Type
Article: article from journal or magazin.
Collection
Publications
Institution
Title
Sharing sensitive data in life sciences: an overview of centralized and federated approaches.
Journal
Briefings in bioinformatics
Author(s)
Rujano M.A., Boiten J.W., Ohmann C., Canham S., Contrino S., David R., Ewbank J., Filippone C., Connellan C., Custers I., van Nuland R., Mayrhofer M.T., Holub P., Álvarez E.G., Bacry E., Hughes N., Freeberg M.A., Schaffhauser B., Wagener H., Sánchez-Pla A., Bertolini G., Panagiotopoulou M.
ISSN
1477-4054 (Electronic)
ISSN-L
1467-5463
Publication state
Published
Issued date
23/05/2024
Peer-reviewed
Oui
Volume
25
Number
4
Language
english
Notes
Publication types: Journal Article ; Review
Publication Status: ppublish
Abstract
Biomedical data are generated and collected from various sources, including medical imaging, laboratory tests and genome sequencing. Sharing these data for research can help address unmet health needs, contribute to scientific breakthroughs, accelerate the development of more effective treatments and inform public health policy. Due to the potential sensitivity of such data, however, privacy concerns have led to policies that restrict data sharing. In addition, sharing sensitive data requires a secure and robust infrastructure with appropriate storage solutions. Here, we examine and compare the centralized and federated data sharing models through the prism of five large-scale and real-world use cases of strategic significance within the European data sharing landscape: the French Health Data Hub, the BBMRI-ERIC Colorectal Cancer Cohort, the federated European Genome-phenome Archive, the Observational Medical Outcomes Partnership/OHDSI network and the EBRAINS Medical Informatics Platform. Our analysis indicates that centralized models facilitate data linkage, harmonization and interoperability, while federated models facilitate scaling up and legal compliance, as the data typically reside on the data generator's premises, allowing for better control of how data are shared. This comparative study thus offers guidance on the selection of the most appropriate sharing strategy for sensitive datasets and provides key insights for informed decision-making in data sharing efforts.
Keywords
Information Dissemination, Humans, Biological Science Disciplines, Medical Informatics/methods, FAIR principles, GDPR, biomedical data, centralized model, data sharing, federated model
Pubmed
Web of science
Open Access
Yes
Create date
14/06/2024 14:29
Last modification date
20/08/2024 6:23
Usage data