
doi: 10.1002/rcm.1992
pmid: 15945033
Abstract This paper describes an algorithm to apply proteotypic peptide sequence libraries to protein identifications performed using tandem mass spectrometry (MS/MS). Proteotypic peptides are those peptides in a protein sequence that are most likely to be confidently observed by current MS‐based proteomics methods. Libraries of proteotypic peptide sequences were compiled from the Global Proteome Machine Database for Homo sapiens and Saccharomyces cerevisiae model species proteomes. These libraries were used to scan through collections of tandem mass spectra to discover which proteins were represented by the data sets, followed by detailed analysis of the spectra with the full protein sequences corresponding to the discovered proteotypic peptides. This algorithm (Proteotypic Peptide Profiling, or P3) resulted in sequence‐to‐spectrum matches comparable to those obtained by conventional protein identification algorithms using only full protein sequences, with a 20‐fold reduction in the time required to perform the identification calculations. The proteotypic peptide libraries, the open source code for the implementation of the search algorithm and a website for using the software have been made freely available. Approximately 4% of the residues in the H. sapiens proteome were required in the proteotypic peptide library to successfully identify proteins. Copyright © 2005 John Wiley & Sons, Ltd.
Chromatography, Gas, Saccharomyces cerevisiae Proteins, Proteome, Protein Hydrolysates, Molecular Sequence Data, Mass Spectrometry, Peptide Library, Databases, Genetic, Humans, Trypsin, Amino Acid Sequence, Peptides, Algorithms, Software
Chromatography, Gas, Saccharomyces cerevisiae Proteins, Proteome, Protein Hydrolysates, Molecular Sequence Data, Mass Spectrometry, Peptide Library, Databases, Genetic, Humans, Trypsin, Amino Acid Sequence, Peptides, Algorithms, Software
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 152 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 1% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 1% |
