Downloads provided by UsageCounts
O-GlcNAcylation is an essential post-translational modification (PTM) in mammalian cells. It consists in the addition of a N-acetylglucosamine (GlcNAc) residue onto serines or threonines by an O-GlcNAc transferase (OGT). Inhibition of OGT is lethal, and misregulation of this PTM can lead to diverse pathologies including diabetes, Alzheimer's disease and cancers. Knowing the location of O-GlcNAcylation sites and the ability to accurately predict them is therefore of prime importance to a better understanding of this process and its related pathologies.Here, we present an evaluation of the current predictors of O-GlcNAcylation sites based on a newly built dataset and an investigation to improve predictions.Several datasets of experimentally proven O-GlcNAcylated sites were combined, and the resulting meta-dataset was used to evaluate three prediction tools. We further defined a set of new features following the analysis of the primary to tertiary structures of experimentally proven O-GlcNAcylated sites in order to improve predictions by the use of different types of machine learning techniques.Our results show the failure of currently available algorithms to predict O-GlcNAcylated sites with a precision exceeding 9%. Our efforts to improve the precision with new features using machine learning techniques do succeed for equal proportions of O-GlcNAcylated and non-O-GlcNAcylated sites but fail like the other tools for real-life proportions where ~1.4% of S/T are O-GlcNAcylated.Present-day algorithms for O-GlcNAcylation prediction narrowly outperform random prediction. The inclusion of additional features, in combination with machine learning algorithms, does not enhance these predictions, emphasizing a pressing need for further development. We hypothesize that the improvement of prediction algorithms requires characterization of OGT's partners.
570, glycosylation, Molecular Biology/Biochemistry [q-bio.BM], [SDV.BC.IC] Life Sciences [q-bio]/Cellular Biology/Cell Behavior [q-bio.CB], [SDV.BBM.BM] Life Sciences [q-bio]/Biochemistry, Molecular Biology/Molecular biology, [SDV.BDD] Life Sciences [q-bio]/Development Biology, [SDV.BC.BC] Life Sciences [q-bio]/Cellular Biology/Subcellular Processes [q-bio.SC], dataset, [SDV.BV] Life Sciences [q-bio]/Vegetal Biology, [SDV.BC] Life Sciences [q-bio]/Cellular Biology, [SDV.BBM.BC] Life Sciences [q-bio]/Biochemistry, Molecular Biology/Biochemistry [q-bio.BM], [SDV.BV.PEP] Life Sciences [q-bio]/Vegetal Biology/Phytopathology and phytopharmacy, Original Research, machine learning;glycosylation;O-GlcNAc;post-translational modification;dataset;OGT, [SDV.BIBS] Life Sciences [q-bio]/Quantitative Methods [q-bio.QM], [SDV.BIBS]Life Sciences [q-bio]/Quantitative Methods [q-bio.QM], [SDV.BIO] Life Sciences [q-bio]/Biotechnology, machine learning, [SDV.BBM.BC]Life Sciences [q-bio]/Biochemistry, post-translational modification, Advances and Applications in Bioinformatics and Chemistry, OGT, O-GlcNAc, [SDV.BBM.GTP] Life Sciences [q-bio]/Biochemistry, Molecular Biology/Genomics [q-bio.GN], [SDV.BV.AP] Life Sciences [q-bio]/Vegetal Biology/Plant breeding
570, glycosylation, Molecular Biology/Biochemistry [q-bio.BM], [SDV.BC.IC] Life Sciences [q-bio]/Cellular Biology/Cell Behavior [q-bio.CB], [SDV.BBM.BM] Life Sciences [q-bio]/Biochemistry, Molecular Biology/Molecular biology, [SDV.BDD] Life Sciences [q-bio]/Development Biology, [SDV.BC.BC] Life Sciences [q-bio]/Cellular Biology/Subcellular Processes [q-bio.SC], dataset, [SDV.BV] Life Sciences [q-bio]/Vegetal Biology, [SDV.BC] Life Sciences [q-bio]/Cellular Biology, [SDV.BBM.BC] Life Sciences [q-bio]/Biochemistry, Molecular Biology/Biochemistry [q-bio.BM], [SDV.BV.PEP] Life Sciences [q-bio]/Vegetal Biology/Phytopathology and phytopharmacy, Original Research, machine learning;glycosylation;O-GlcNAc;post-translational modification;dataset;OGT, [SDV.BIBS] Life Sciences [q-bio]/Quantitative Methods [q-bio.QM], [SDV.BIBS]Life Sciences [q-bio]/Quantitative Methods [q-bio.QM], [SDV.BIO] Life Sciences [q-bio]/Biotechnology, machine learning, [SDV.BBM.BC]Life Sciences [q-bio]/Biochemistry, post-translational modification, Advances and Applications in Bioinformatics and Chemistry, OGT, O-GlcNAc, [SDV.BBM.GTP] Life Sciences [q-bio]/Biochemistry, Molecular Biology/Genomics [q-bio.GN], [SDV.BV.AP] Life Sciences [q-bio]/Vegetal Biology/Plant breeding
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 5 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
| views | 22 | |
| downloads | 14 |

Views provided by UsageCounts
Downloads provided by UsageCounts