Perspectives on tracking data reuse across biodata resources.
Details
Serval ID
serval:BIB_3F56A16EA193
Type
Article: article from journal or magazin.
Collection
Publications
Institution
Title
Perspectives on tracking data reuse across biodata resources.
Journal
Bioinformatics advances
Working group(s)
and the UniProt Consortium
Contributor(s)
Bateman A., Martin M.J., Orchard S., Magrane M., Ahmad S., Bowler-Barnett E.H., Bye-A-Jee H., Denny P., Dogan T., Ebenezer T., Fan J., da Costa Gonzales L.J., Hussein A., Ignatchenko A., Insana G., Ishtiaq R., Joshi V., Jyothi D., Kandasaamy S., Lock A., Luciani A., Luo J., Lussi Y., Raposo P., Rice D.L., Saidi R., Santos R., Speretta E., Stephenson J., Totoo P., Tyagi N., Vasudev P., Warner K., Zaru R., Wijerathne S., Ibrahim K.T., Kim M., Marin J., Bridge A.J., Aimo L., Argoud-Puy G., Auchincloss A.H., Axelsen K.B., Bansal P., Baratin D., Batista Neto T.M., Bolleman J.T., Boutet E., Breuza L., Gil B.C., Casals-Casas C., Coudert E., Cuche B., de Castro E., Estreicher A., Famiglietti M.L., Feuermann M., Gasteiger E., Gehant S., Gos A., Gruaz N., Hulo C., Hyka-Nouspikel N., Jungo F., Kerhornou A., Le Mercier P., Lieberherr D., Masson P., Morgat A., Pedruzzi I., Pilbout S., Pourcel L., Poux S., Pozzato M., Pruess M., Redaschi N., Rivoire C., Sigrist CJA, Sundaram S., Sveshnikova A., Wu C.H., Arighi C.N., Chen C., Chen Y., Huang H., Laiho K., Lehvaslaiho M., McGarvey P., Natale D.A., Ross K., Vinayaka C.R., Wang Y., Zhang J.
ISSN
2635-0041 (Electronic)
ISSN-L
2635-0041
Publication state
Published
Issued date
2024
Peer-reviewed
Oui
Volume
4
Number
1
Pages
vbae057
Language
english
Notes
Publication types: Editorial
Publication Status: epublish
Publication Status: epublish
Abstract
Data reuse is a common and vital practice in molecular biology and enables the knowledge gathered over recent decades to drive discovery and innovation in the life sciences. Much of this knowledge has been collated into molecular biology databases, such as UniProtKB, and these resources derive enormous value from sharing data among themselves. However, quantifying and documenting this kind of data reuse remains a challenge.
The article reports on a one-day virtual workshop hosted by the UniProt Consortium in March 2023, attended by representatives from biodata resources, experts in data management, and NIH program managers. Workshop discussions focused on strategies for tracking data reuse, best practices for reusing data, and the challenges associated with data reuse and tracking. Surveys and discussions showed that data reuse is widespread, but critical information for reproducibility is sometimes lacking. Challenges include costs of tracking data reuse, tensions between tracking data and open sharing, restrictive licenses, and difficulties in tracking commercial data use. Recommendations that emerged from the discussion include: development of standardized formats for documenting data reuse, education about the obstacles posed by restrictive licenses, and continued recognition by funding agencies that data management is a critical activity that requires dedicated resources.
Summaries of survey results are available at: https://docs.google.com/forms/d/1j-VU2ifEKb9C-sW6l3ATB79dgHdRk5v_lESv2hawnso/viewanalytics (survey of data providers) and https://docs.google.com/forms/d/18WbJFutUd7qiZoEzbOytFYXSfWFT61hVce0vjvIwIjk/viewanalytics (survey of users).
The article reports on a one-day virtual workshop hosted by the UniProt Consortium in March 2023, attended by representatives from biodata resources, experts in data management, and NIH program managers. Workshop discussions focused on strategies for tracking data reuse, best practices for reusing data, and the challenges associated with data reuse and tracking. Surveys and discussions showed that data reuse is widespread, but critical information for reproducibility is sometimes lacking. Challenges include costs of tracking data reuse, tensions between tracking data and open sharing, restrictive licenses, and difficulties in tracking commercial data use. Recommendations that emerged from the discussion include: development of standardized formats for documenting data reuse, education about the obstacles posed by restrictive licenses, and continued recognition by funding agencies that data management is a critical activity that requires dedicated resources.
Summaries of survey results are available at: https://docs.google.com/forms/d/1j-VU2ifEKb9C-sW6l3ATB79dgHdRk5v_lESv2hawnso/viewanalytics (survey of data providers) and https://docs.google.com/forms/d/18WbJFutUd7qiZoEzbOytFYXSfWFT61hVce0vjvIwIjk/viewanalytics (survey of users).
Pubmed
Open Access
Yes
Create date
10/05/2024 13:34
Last modification date
11/05/2024 7:51