MUSI: an integrated system for identifying multiple specificity from very large peptide or nucleic acid data sets.

Kim, T.; Tyndel, M.S.; Huang, H.; Sidhu, S.S.; Bader, G.D.; Gfeller, D.; Kim, P.M.

doi:10.1093/nar/gkr1294

MUSI: an integrated system for identifying multiple specificity from very large peptide or nucleic acid data sets.

Détails

Télécharger: 22210894_BIB_55E64F2C6DDD.pdf (1414.50 [Ko])
Etat: Public
Version: Final published version

ID Serval

serval:BIB_55E64F2C6DDD

Type

Article: article d'un périodique ou d'un magazine.

Collection

Publications

Institution

Production externe

Titre

MUSI: an integrated system for identifying multiple specificity from very large peptide or nucleic acid data sets.

Périodique

Nucleic Acids Research

Auteur⸱e⸱s

Kim T., Tyndel M.S., Huang H., Sidhu S.S., Bader G.D., Gfeller D., Kim P.M.

ISSN

1362-4962 (Electronic)

ISSN-L

0305-1048

Statut éditorial

Publié

Date de publication

2012

Peer-reviewed

Oui

Volume

Numéro

Pages

e47

Langue

anglais

Résumé

Peptide recognition domains and transcription factors play crucial roles in cellular signaling. They bind linear stretches of amino acids or nucleotides, respectively, with high specificity. Experimental techniques that assess the binding specificity of these domains, such as microarrays or phage display, can retrieve thousands of distinct ligands, providing detailed insight into binding specificity. In particular, the advent of next-generation sequencing has recently increased the throughput of such methods by several orders of magnitude. These advances have helped reveal the presence of distinct binding specificity classes that co-exist within a set of ligands interacting with the same target. Here, we introduce a software system called MUSI that can rapidly analyze very large data sets of binding sequences to determine the relevant binding specificity patterns. Our pipeline provides two major advances. First, it can detect previously unrecognized multiple specificity patterns in any data set. Second, it offers integrated processing of very large data sets from next-generation sequencing machines. The results are visualized as multiple sequence logos describing the different binding preferences of the protein under investigation. We demonstrate the performance of MUSI by analyzing recent phage display data for human SH3 domains as well as microarray data for mouse transcription factors.

Mots-clé

Animals, Binding Sites, High-Throughput Nucleotide Sequencing, Humans, Ligands, Mice, Peptide Library, Peptides/chemistry, Position-Specific Scoring Matrices, Protein Interaction Domains and Motifs, Sequence Analysis, Protein, Software, Transcription Factors/metabolism, src Homology Domains

DOI

10.1093/nar/gkr1294

Pubmed

22210894

Web of science

000302312400008

Open Access

Oui

Création de la notice

15/12/2014 13:21

Dernière modification de la notice

20/08/2019 14:10

Données d'usage

SERVAL

serveur académique lausannois

MUSI: an integrated system for identifying multiple specificity from very large peptide or nucleic acid data sets.

Détails