Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometry

descriptionPublicationkeyboard_double_arrow_right Article 27 Feb 2007 English Publisher:Springer Science and Business Media LLCJournal:Nature Methods, volume 4, pages 207-214 (issn: 1548-7091, eissn: 1548-7105,

Copyright policy )Funded by:NIH | New and Disruptive Techno...

Authors: Joshua E, Elias; Steven P, Gygi;

doi: 10.1038/nmeth1019

pmid: 17327847

Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometry

- Summary
- Subjects
- Metrics

Abstract

Liquid chromatography and tandem mass spectrometry (LC-MS/MS) has become the preferred method for conducting large-scale surveys of proteomes. Automated interpretation of tandem mass spectrometry (MS/MS) spectra can be problematic, however, for a variety of reasons. As most sequence search engines return results even for 'unmatchable' spectra, proteome researchers must devise ways to distinguish correct from incorrect peptide identifications. The target-decoy search strategy represents a straightforward and effective way to manage this effort. Despite the apparent simplicity of this method, some controversy surrounds its successful application. Here we clarify our preferred methodology by addressing four issues based on observed decoy hit frequencies: (i) the major assumptions made with this database search strategy are reasonable; (ii) concatenated target-decoy database searches are preferable to separate target and decoy database searches; (iii) the theoretical error associated with target-decoy false positive (FP) rate measurements can be estimated; and (iv) alternate methods for constructing decoy databases are similarly effective once certain considerations are taken into account.

Related Organizations

Harvard University
United States

Keywords

Proteome, Molecular Sequence Data, Information Storage and Retrieval, Reproducibility of Results, Peptide Mapping, Sensitivity and Specificity, Mass Spectrometry, Sequence Analysis, Protein, Database Management Systems, Amino Acid Sequence, Databases, Protein, Sequence Alignment, Algorithms

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	4K
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 0.01%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 0.01%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 0.01%