
Abstract We present a general method for assessing threading score significance. The threading score of a protein sequence, thread onto a given structure, should be compared with the threading score distribution of a random amino-acid sequence, of the same length, thread on the same structure; small p-values point significantly high scores. We claim that, due to general protein contact map properties, this reference distribution is a Weibull extreme value distribution whose parameters depend on the threading method, the structure, the length of the query and the random sequence simulation model used. These parameters can be estimated off-line with simulated sequence samples, for different sequence lengths. They can further be interpolated at the exact length of a query, enabling the quick computation of the p-value.
[SDV.BIBS] Life Sciences [q-bio]/Quantitative Methods [q-bio.QM], Models, Statistical, Protein Conformation, Computational Biology, Proteins, MARKOV CHAINS, 612, [SDV.BIBS]Life Sciences [q-bio]/Quantitative Methods [q-bio.QM], SEQUENCE ANALYSIS, STATISTICS, Markov Chains, STOCHASTIC PROCESS, biologie moléculaire computationnelle, COMPUTATIONAL MOLECULAR BIOLOGY, Computer Simulation, Amino Acid Sequence, Sequence Alignment, Sequence Analysis, Algorithms, Statistical Distributions
[SDV.BIBS] Life Sciences [q-bio]/Quantitative Methods [q-bio.QM], Models, Statistical, Protein Conformation, Computational Biology, Proteins, MARKOV CHAINS, 612, [SDV.BIBS]Life Sciences [q-bio]/Quantitative Methods [q-bio.QM], SEQUENCE ANALYSIS, STATISTICS, Markov Chains, STOCHASTIC PROCESS, biologie moléculaire computationnelle, COMPUTATIONAL MOLECULAR BIOLOGY, Computer Simulation, Amino Acid Sequence, Sequence Alignment, Sequence Analysis, Algorithms, Statistical Distributions
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 1 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
