
doi: 10.3390/a14020058
handle: 11367/102274
Data streams are ubiquitous and related to the proliferation of low-cost mobile devices, sensors, wireless networks and the Internet of Things. While it is well known that complex phenomena are not stationary and exhibit a concept drift when observed for a sufficiently long time, relatively few studies have addressed the related problem of feature drift. In this paper, a variation of the QuickReduct algorithm suitable to process data streams is proposed and tested: it builds an evolving reduct that dynamically selects the relevant features in the stream, removing the redundant ones and adding the newly relevant ones as soon as they become such. Tests on five publicly available datasets with an artificially injected drift have confirmed the effectiveness of the proposed method.
granulation, feature selection, Industrial engineering. Management engineering, Electronic computers. Computer science, QuickReduct, Concept drift; Feature drift; Feature selection; Granulation; Quickreduct; Rough set theory, feature drift, QA75.5-76.95, T55.4-60.8, concept drift, rough set theory
granulation, feature selection, Industrial engineering. Management engineering, Electronic computers. Computer science, QuickReduct, Concept drift; Feature drift; Feature selection; Granulation; Quickreduct; Rough set theory, feature drift, QA75.5-76.95, T55.4-60.8, concept drift, rough set theory
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 6 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
