Downloads provided by UsageCounts
This Zenodo entry contains the result of the feature selection algorithm implemented through a backward variable elimination strategy and in chopin2 applied on MetaPhlAn3 microbial profiles of a public dataset of metagenomic stool samples collected from patients affected by the colorectal cancer (CRC) as well as from healthy individuals. Microbial profiles have been extracted through the curatedMetagenomicData package for R under the IDs ThomasAM_2018a, ThomasAM_2018b, and ThomasAM_2019_a. The feature selection algorithm is implemented as a backward variable elimination method, and it makes use of the vector-symbolic architecture described in Cumbo F 2020. Deposited data is described below: datasets.tar.gz: it contains the datasets used as input of chopin2 as the result of merging the three datasets with relative abundances mentioned above, also stratified by age and sex (with prefix RA). The same datasets have been also binarized (with prefix BIN); hd-models.tar.gz: it contains the output of the feature selection performed with chopin2 on the datasets with both relative abundance and binary profiles (RA and BIN); ml-models.tar.gz: it contains the result of the feature selection produced with classical wrapper-based techniques (i.e., Random Forest, Decision Tree, Support Vector Machine, Logistic Regression, and Extreme Gradient Boosting) in addition to a Python 3.8 script to reproduce the results. Please note that the datasets RA__ThomasAM__species.csv and BIN__ThomasAM__species.csv are also included into the datasets.tar.gz archive.
{"references": ["Cumbo, Fabio at al. (2020). \"A brain-inspired hyperdimensional computing approach for classifying massive DNA methylation data of cancer\". MDPI Algorithms 2020"]}
hyperdimensional computing, metagenomics, vector-symbolic architecture, machine learning, feature selection, colorectal cancer, chopin2
hyperdimensional computing, metagenomics, vector-symbolic architecture, machine learning, feature selection, colorectal cancer, chopin2
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
| views | 76 | |
| downloads | 76 |

Views provided by UsageCounts
Downloads provided by UsageCounts