Uncovering hidden duplicated content in public transcriptomics data.
Details
Download: BIB_230AE9FE1902.P001.pdf (85.86 [Ko])
State: Public
Version: author
State: Public
Version: author
Serval ID
serval:BIB_230AE9FE1902
Type
Article: article from journal or magazin.
Collection
Publications
Institution
Title
Uncovering hidden duplicated content in public transcriptomics data.
Journal
Database
ISSN
1758-0463 (Electronic)
Publication state
Published
Issued date
2013
Peer-reviewed
Oui
Volume
2013
Pages
bat010
Language
english
Notes
Database URL: http://bgee. unil. ch/
Abstract
As part of the development of the database Bgee (a dataBase for Gene Expression Evolution), we annotate and analyse expression data from different types and different sources, notably Affymetrix data from GEO and ArrayExpress, and RNA-Seq data from SRA. During our quality control procedure, we have identified duplicated content in GEO and ArrayExpress, affecting ∼14% of our data: fully or partially duplicated experiments from independent data submissions, Affymetrix chips reused in several experiments, or reused within an experiment. We present here the procedure that we have established to filter such duplicates from Affymetrix data, and our procedure to identify future potential duplicates in RNA-Seq data. Database URL: http://bgee.unil.ch/
Pubmed
Web of science
Open Access
Yes
Create date
20/02/2013 17:46
Last modification date
20/08/2019 13:00