
We present novel kernels that measure similarity of two RNA sequences, taking account of their secondary structures. Two types of kernels are presented. One is for RNA sequences with known secondary structures, the other for those without known secondary structures. The latter employs stochastic context-free grammar (SCFG) for estimating the secondary structure. We call the latter the marginalized count kernel (MCK). We show computational experiments for MCK using 74 sets of human tRNA sequence data: (i) kernel principal component analysis (PCA) for visualizing tRNA similarities, (ii) supervised classification with support vector machines (SVMs). Both types of experiment show promising results for MCKs.
feature vector, SCFG, Sequence Analysis, RNA, count kernel, SVM, kernel PCA, Computational Biology, marginalized count kernel, RNA, Transfer, ROC Curve, kernel, Data Interpretation, Statistical, Humans, Nucleic Acid Conformation
feature vector, SCFG, Sequence Analysis, RNA, count kernel, SVM, kernel PCA, Computational Biology, marginalized count kernel, RNA, Transfer, ROC Curve, kernel, Data Interpretation, Statistical, Humans, Nucleic Acid Conformation
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 10 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
