Database construction and Peptide identification strategies for proteogenomic studies on sequenced genomes.

Hernandez, C.; Waridel, P.; Quadroni, M.

doi:10.2174/1568026613666131204105652

Database construction and Peptide identification strategies for proteogenomic studies on sequenced genomes.

Details

Request a copy

Serval ID

serval:BIB_78D23B982235

Type

Article: article from journal or magazin.

Publication sub-type

Review (review): journal as complete as possible of one specific subject, written based on exhaustive analyses from published work.

Collection

Publications

Institution

UNIL/CHUV

Title

Database construction and Peptide identification strategies for proteogenomic studies on sequenced genomes.

Journal

Current Topics in Medicinal Chemistry

Author(s)

Hernandez C., Waridel P., Quadroni M.

ISSN

1873-4294 (Electronic)

ISSN-L

1568-0266

Publication state

Published

Issued date

2014

Volume

Number

Pages

425-434

Language

english

Abstract

Since the advent of high-throughput DNA sequencing technologies, the ever-increasing rate at which genomes have been published has generated new challenges notably at the level of genome annotation. Even if gene predictors and annotation softwares are more and more efficient, the ultimate validation is still in the observation of predicted gene product( s). Mass-spectrometry based proteomics provides the necessary high throughput technology to show evidences of protein presence and, from the identified sequences, confirmation or invalidation of predicted annotations. We review here different strategies used to perform a MS-based proteogenomics experiment with a bottom-up approach. We start from the strengths and weaknesses of the different database construction strategies, based on different genomic information (whole genome, ORF, cDNA, EST or RNA-Seq data), which are then used for matching mass spectra to peptides and proteins. We also review the important points to be considered for a correct statistical assessment of the peptide identifications. Finally, we provide references for tools used to map and visualize the peptide identifications back to the original genomic information.

Keywords

Proteogenomics, databases, bioinformatics, gene annotation, mass-spectrometry, proteomics

OAI-PMH

oai:serval.unil.ch:BIB_78D23B982235

DOI

10.2174/1568026613666131204105652

Pubmed

24304320

Web of science

000331379500019

Create date

14/02/2014 13:00