Uncovering hidden duplicated content in public transcriptomics data.

Détails

Ressource 1Télécharger: BIB_230AE9FE1902.P001.pdf (85.86 [Ko])
Etat: Public
Version: de l'auteur⸱e
ID Serval
serval:BIB_230AE9FE1902
Type
Article: article d'un périodique ou d'un magazine.
Collection
Publications
Institution
Titre
Uncovering hidden duplicated content in public transcriptomics data.
Périodique
Database
Auteur⸱e⸱s
Rosikiewicz M., Comte A., Niknejad A., Robinson-Rechavi M., Bastian F.B.
ISSN
1758-0463 (Electronic)
Statut éditorial
Publié
Date de publication
2013
Peer-reviewed
Oui
Volume
2013
Pages
bat010
Langue
anglais
Notes
Database URL: http://bgee. unil. ch/
Résumé
As part of the development of the database Bgee (a dataBase for Gene Expression Evolution), we annotate and analyse expression data from different types and different sources, notably Affymetrix data from GEO and ArrayExpress, and RNA-Seq data from SRA. During our quality control procedure, we have identified duplicated content in GEO and ArrayExpress, affecting ∼14% of our data: fully or partially duplicated experiments from independent data submissions, Affymetrix chips reused in several experiments, or reused within an experiment. We present here the procedure that we have established to filter such duplicates from Affymetrix data, and our procedure to identify future potential duplicates in RNA-Seq data. Database URL: http://bgee.unil.ch/
Pubmed
Web of science
Open Access
Oui
Création de la notice
20/02/2013 18:46
Dernière modification de la notice
20/08/2019 14:00
Données d'usage