
doi: 10.1002/prot.20160
pmid: 15281121
AbstractWe describe a new method for using neural networks to predict residue contact pairs in a protein. The main inputs to the neural network are a set of 25 measures of correlated mutation between all pairs of residues in two “windows” of size 5 centered on the residues of interest. While the individual pair‐wise correlations are a relatively weak predictor of contact, by training the network on windows of correlation the accuracy of prediction is significantly improved. The neural network is trained on a set of 100 proteins and then tested on a disjoint set of 1033 proteins of known structure. An average predictive accuracy of 21.7% is obtained taking the best L/2 predictions for each protein, where L is the sequence length. Taking the best L/10 predictions gives an average accuracy of 30.7%. The predictor is also tested on a set of 59 proteins from the CASP5 experiment. The accuracy is found to be relatively consistent across different sequence lengths, but to vary widely according to the secondary structure. Predictive accuracy is also found to improve by using multiple sequence alignments containing many sequences to calculate the correlations. Proteins 2004. © 2004 Wiley‐Liss, Inc.
Biochemistry & Molecular Biology, Secondary Structure, Neural Network, Biophysics, Protein Structure Prediction, 612, Protein Structure, Secondary, C1, Artificial Intelligence, Predictive Value of Tests, Protein Interaction Mapping, Amino Acids, Casp5, Correlated Mutation, 250503 Characterisation of Macromolecules, 239901 Biological Mathematics, 780105 Biological sciences, Residue Contacts, Cysteine Endopeptidases, Predicted Contact Map, Templates, Caspases, Neural Networks, Computer, Sequence Alignment, Mutations
Biochemistry & Molecular Biology, Secondary Structure, Neural Network, Biophysics, Protein Structure Prediction, 612, Protein Structure, Secondary, C1, Artificial Intelligence, Predictive Value of Tests, Protein Interaction Mapping, Amino Acids, Casp5, Correlated Mutation, 250503 Characterisation of Macromolecules, 239901 Biological Mathematics, 780105 Biological sciences, Residue Contacts, Cysteine Endopeptidases, Predicted Contact Map, Templates, Caspases, Neural Networks, Computer, Sequence Alignment, Mutations
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 59 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
