Phylogenetic and functional assessment of orthologs inference projects and methods.

Altenhoff, A.M.; Dessimoz, C.

doi:10.1371/journal.pcbi.1000262

Phylogenetic and functional assessment of orthologs inference projects and methods.

Détails

Demande d'une copie

ID Serval

serval:BIB_735B6D029A2B

Type

Article: article d'un périodique ou d'un magazine.

Collection

Publications

Institution

Production externe

Titre

Phylogenetic and functional assessment of orthologs inference projects and methods.

Périodique

PLoS computational biology

Auteur⸱e⸱s

Altenhoff A.M., Dessimoz C.

ISSN

1553-7358 (Electronic)

ISSN-L

1553-734X

Statut éditorial

Publié

Date de publication

01/2009

Peer-reviewed

Oui

Volume

Numéro

Pages

e1000262

Langue

anglais

Notes

Publication types: Journal Article
Publication Status: ppublish

Résumé

Accurate genome-wide identification of orthologs is a central problem in comparative genomics, a fact reflected by the numerous orthology identification projects developed in recent years. However, only a few reports have compared their accuracy, and indeed, several recent efforts have not yet been systematically evaluated. Furthermore, orthology is typically only assessed in terms of function conservation, despite the phylogeny-based original definition of Fitch. We collected and mapped the results of nine leading orthology projects and methods (COG, KOG, Inparanoid, OrthoMCL, Ensembl Compara, Homologene, RoundUp, EggNOG, and OMA) and two standard methods (bidirectional best-hit and reciprocal smallest distance). We systematically compared their predictions with respect to both phylogeny and function, using six different tests. This required the mapping of millions of sequences, the handling of hundreds of millions of predicted pairs of orthologs, and the computation of tens of thousands of trees. In phylogenetic analysis or in functional analysis where high specificity is required, we find that OMA and Homologene perform best. At lower functional specificity but higher coverage level, OrthoMCL outperforms Ensembl Compara, and to a lesser extent Inparanoid. Lastly, the large coverage of the recent EggNOG can be of interest to build broad functional grouping, but the method is not specific enough for phylogenetic or detailed function analyses. In terms of general methodology, we observe that the more sophisticated tree reconstruction/reconciliation approach of Ensembl Compara was at times outperformed by pairwise comparison approaches, even in phylogenetic tests. Furthermore, we show that standard bidirectional best-hit often outperforms projects with more complex algorithms. First, the present study provides guidance for the broad community of orthology data users as to which database best suits their needs. Second, it introduces new methodology to verify orthology. And third, it sets performance standards for current and future approaches.

Mots-clé

Animals, Databases, Genetic, Genetic Speciation, Genomics/methods, Genomics/standards, Humans, Models, Genetic, Phylogeny, Physiology, Comparative, Sensitivity and Specificity, Species Specificity

DOI

10.1371/journal.pcbi.1000262

Pubmed

19148271

Web of science

000263924300016

Open Access

Oui

Création de la notice

02/09/2015 9:16

Dernière modification de la notice

06/03/2024 10:33

Données d'usage

SERVAL

serveur académique lausannois

Phylogenetic and functional assessment of orthologs inference projects and methods.

Détails