
Gene expression measurements represent the most important source of biological data used to unveil the interaction and functionality of genes. In this regard, several data mining and machine learning algorithms have been proposed that require, in a number of cases, some kind of data discretization to perform the inference. Selection of an appropriate discretization process has a major impact on the design and outcome of the inference algorithms, as there are a number of relevant issues that need to be considered. This study presents a revision of the current state-of-the-art discretization techniques, together with the key subjects that need to be considered when designing or selecting a discretization approach for gene expression data.
Gene Expression Profiling, Gene Expression, Machine Learning, Data Preprocessing, Data Mining, https://purl.org/becyt/ford/1.2, Gene Expressiion Analysis, Gene Expression Data, https://purl.org/becyt/ford/1, Algorithms, Discretization
Gene Expression Profiling, Gene Expression, Machine Learning, Data Preprocessing, Data Mining, https://purl.org/becyt/ford/1.2, Gene Expressiion Analysis, Gene Expression Data, https://purl.org/becyt/ford/1, Algorithms, Discretization
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 54 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
