Protein contact prediction using patterns of correlation

descriptionPublicationkeyboard_double_arrow_right Article 14 May 2004 Australia English Publisher:WileyJournal:Proteins: Structure, Function, and Bioinformatics, volume 56, pages 679-684 (issn: 0887-3585, eissn: 1097-0134,

Copyright policy )

Authors: Hamilton, Nicholas; Burrage, Kevin; Ragan, Mark A.; Huber, Thomas;

doi: 10.1002/prot.20160

pmid: 15281121

Protein contact prediction using patterns of correlation

- Summary
- Subjects
- Metrics

Abstract

AbstractWe describe a new method for using neural networks to predict residue contact pairs in a protein. The main inputs to the neural network are a set of 25 measures of correlated mutation between all pairs of residues in two “windows” of size 5 centered on the residues of interest. While the individual pair‐wise correlations are a relatively weak predictor of contact, by training the network on windows of correlation the accuracy of prediction is significantly improved. The neural network is trained on a set of 100 proteins and then tested on a disjoint set of 1033 proteins of known structure. An average predictive accuracy of 21.7% is obtained taking the best L/2 predictions for each protein, where L is the sequence length. Taking the best L/10 predictions gives an average accuracy of 30.7%. The predictor is also tested on a set of 59 proteins from the CASP5 experiment. The accuracy is found to be relatively consistent across different sequence lengths, but to vary widely according to the secondary structure. Predictive accuracy is also found to improve by using multiple sequence alignments containing many sequences to calculate the correlations. Proteins 2004. © 2004 Wiley‐Liss, Inc.

Country

Australia

Related Organizations

Keywords

Biochemistry & Molecular Biology, Secondary Structure, Neural Network, Biophysics, Protein Structure Prediction, 612, Protein Structure, Secondary, C1, Artificial Intelligence, Predictive Value of Tests, Protein Interaction Mapping, Amino Acids, Casp5, Correlated Mutation, 250503 Characterisation of Macromolecules, 239901 Biological Mathematics, 780105 Biological sciences, Residue Contacts, Cysteine Endopeptidases, Predicted Contact Map, Templates, Caspases, Neural Networks, Computer, Sequence Alignment, Mutations

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	59
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%

Found an issue? Give us feedback

59

Top 10%

bronze

Fields of Science (4) View all

engineering and technology

medical engineering

Fields of Science

engineering and technology

medical engineering

View all

Related to Research communities

Knowmad Institut