3DCoffee: combining protein sequences and structures within multiple sequence alignments.

Détails

ID Serval
serval:BIB_32050
Type
Article: article d'un périodique ou d'un magazine.
Collection
Publications
Institution
Titre
3DCoffee: combining protein sequences and structures within multiple sequence alignments.
Périodique
Journal of Molecular Biology
Auteur⸱e⸱s
O'Sullivan O., Suhre K., Abergel C., Higgins D.G., Notredame C.
ISSN
0022-2836
Statut éditorial
Publié
Date de publication
2004
Volume
340
Numéro
2
Pages
385-395
Langue
anglais
Résumé
Most bioinformatics analyses require the assembly of a multiple sequence alignment. It has long been suspected that structural information can help to improve the quality of these alignments, yet the effect of combining sequences and structures has not been evaluated systematically. We developed 3DCoffee, a novel method for combining protein sequences and structures in order to generate high-quality multiple sequence alignments. 3DCoffee is based on TCoffee version 2.00, and uses a mixture of pairwise sequence alignments and pairwise structure comparison methods to generate multiple sequence alignments. We benchmarked 3DCoffee using a subset of HOMSTRAD, the collection of reference structural alignments. We found that combining TCoffee with the threading program Fugue makes it possible to improve the accuracy of our HOMSTRAD dataset by four percentage points when using one structure only per dataset. Using two structures yields an improvement of ten percentage points. The measures carried out on HOM39, a HOMSTRAD subset composed of distantly related sequences, show a linear correlation between multiple sequence alignment accuracy and the ratio of number of provided structure to total number of sequences. Our results suggest that in the case of distantly related sequences, a single structure may not be enough for computing an accurate multiple sequence alignment.
Mots-clé
Protein Conformation, Proteins/chemistry, Sequence Alignment
Pubmed
Web of science
Création de la notice
19/11/2007 10:01
Dernière modification de la notice
20/08/2019 13:17
Données d'usage