Downloads provided by UsageCounts
Abstract Motivation Proteins containing tandem repeats (TRs) are abundant, frequently fold in elongated non-globular structures and perform vital functions. A number of computational tools have been developed to detect TRs in protein sequences. A blurred boundary between imperfect TR motifs and non-repetitive sequences gave rise to necessity to validate the detected TRs. Results Tally-2.0 is a scoring tool based on a machine learning (ML) approach, which allows to validate the results of TR detection. It was upgraded by using improved training datasets and additional ML features. Tally-2.0 performs at a level of 93% sensitivity, 83% specificity and an area under the receiver operating characteristic curve of 95%. Availability and implementation Tally-2.0 is available, as a web tool and as a standalone application published under Apache License 2.0, on the URL https://bioinfo.crbm.cnrs.fr/index.php? route=tools&tool=27. It is supported on Linux. Source code is available upon request. Supplementary information Supplementary data are available at Bioinformatics online.
Technology, Biochemistry & Molecular Biology, PREDICTION, Bioinformatics, Statistics & Probability, 46 Information and computing sciences, Biochemical Research Methods, Machine Learning, Amino Acid Sequence, 01 Mathematical Sciences, [INFO.INFO-BI] Computer Science [cs]/Bioinformatics [q-bio.QM], Science & Technology, IDENTIFICATION, 31 Biological sciences, Proteins, 06 Biological Sciences, BOUNDARY, Biotechnology & Applied Microbiology, Tandem Repeat Sequences, Physical Sciences, Computer Science, Computer Science, Interdisciplinary Applications, Mathematical & Computational Biology, 08 Information and Computing Sciences, Life Sciences & Biomedicine, 49 Mathematical sciences, Mathematics, Algorithms, Software
Technology, Biochemistry & Molecular Biology, PREDICTION, Bioinformatics, Statistics & Probability, 46 Information and computing sciences, Biochemical Research Methods, Machine Learning, Amino Acid Sequence, 01 Mathematical Sciences, [INFO.INFO-BI] Computer Science [cs]/Bioinformatics [q-bio.QM], Science & Technology, IDENTIFICATION, 31 Biological sciences, Proteins, 06 Biological Sciences, BOUNDARY, Biotechnology & Applied Microbiology, Tandem Repeat Sequences, Physical Sciences, Computer Science, Computer Science, Interdisciplinary Applications, Mathematical & Computational Biology, 08 Information and Computing Sciences, Life Sciences & Biomedicine, 49 Mathematical sciences, Mathematics, Algorithms, Software
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 4 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
| views | 40 | |
| downloads | 2 |

Views provided by UsageCounts
Downloads provided by UsageCounts