Systematic assessment of pathway databases, based on a diverse collection of user-submitted experiments.

Details

Ressource 1Download: 36088548_BIB_9C804159875D.pdf (1673.36 [Ko])
State: Public
Version: Final published version
License: CC BY-NC 4.0
Serval ID
serval:BIB_9C804159875D
Type
Article: article from journal or magazin.
Collection
Publications
Institution
Title
Systematic assessment of pathway databases, based on a diverse collection of user-submitted experiments.
Journal
Briefings in bioinformatics
Author(s)
Gable A.L., Szklarczyk D., Lyon D., Matias Rodrigues J.F., von Mering C.
ISSN
1477-4054 (Electronic)
ISSN-L
1467-5463
Publication state
Published
Issued date
20/09/2022
Peer-reviewed
Oui
Volume
23
Number
5
Pages
bbac355
Language
english
Notes
Publication types: Journal Article ; Research Support, Non-U.S. Gov't
Publication Status: ppublish
Abstract
A knowledge-based grouping of genes into pathways or functional units is essential for describing and understanding cellular complexity. However, it is not always clear a priori how and at what level of specificity functionally interconnected genes should be partitioned into pathways, for a given application. Here, we assess and compare nine existing and two conceptually novel functional classification systems, with respect to their discovery power and generality in gene set enrichment testing. We base our assessment on a collection of nearly 2000 functional genomics datasets provided by users of the STRING database. With these real-life and diverse queries, we assess which systems typically provide the most specific and complete enrichment results. We find many structural and performance differences between classification systems. Overall, the well-established, hierarchically organized pathway annotation systems yield the best enrichment performance, despite covering substantial parts of the human genome in general terms only. On the other hand, the more recent unsupervised annotation systems perform strongest in understudied areas and organisms, and in detecting more specific pathways, albeit with less informative labels.
Keywords
Databases, Factual, Databases, Genetic, Genomics/methods, Humans, Software, Gene Ontology, STRING, benchmark, functional annotation, gene set enrichment, pathways
Pubmed
Web of science
Open Access
Yes
Create date
20/09/2022 12:02
Last modification date
25/01/2024 7:41
Usage data