<script type="text/javascript">
<!--
document.write('<div id="oa_widget"></div>');
document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=undefined&type=result"></script>');
-->
</script>

COPY SCRIPT

For further information contact us at helpdesk@openaire.eu

streamwise feature selection

Name: streamwise feature selection
Keywords: 519, feature selection, Classification and discrimination; cluster analysis (statistical aspects), classification, multiple regression, Learning and adaptive systems in artificial intelligence, stepwise regression, false discovery rate

Streamwise feature selection

descriptionPublicationkeyboard_double_arrow_right Article 01 Dec 2006 United States Publisher:Microtome Publishing, Brookline, MAJournal:Journal of Machine Learning Research, volume 7, pages 1,861-1,885 (issn: 1532-4435,

Authors: Zhou, Jing; Stine, Robert A; Foster, Dean P; Ungar, Lyle H.;

doi: 10.5555/1248547.1248614

handle: 20.500.14332/6376

streamwise feature selection

- Summary
- Subjects
- Metrics

Abstract

Summary: In streamwise feature selection, new features are sequentially considered for addition to a predictive model. When the space of potential features is large, streamwise feature selection offers many advantages over traditional feature selection methods, which assume that all features are known in advance. Features can be generated dynamically, focusing the search for new features on promising subspaces, and overfitting can be controlled by dynamically adjusting the threshold for adding features to the model. In contrast to traditional forward feature selection algorithms, such as stepwise regression, in which at each step all possible features are evaluated and the best one is selected, streamwise feature selection only evaluates each feature once when it is generated. We describe information-investing and \(\alpha\)-investing, two adaptive complexity penalty methods for streamwise feature selection which dynamically adjust the threshold on the error reduction required for adding a new feature. These two methods give false discovery rate style guarantees against overfitting. They differ from standard penalty methods such as AIC, BIC and RIC, which always drastically over- or under-fit in the limit of infinite numbers of non-predictive features. Empirical results show that streamwise regression is competitive with (on small data sets) and superior to (on large data sets) much more compute-intensive feature selection methods such as stepwise regression, and allows feature selection on problems with millions of potential features.

Country

United States

Related Organizations

University of Pennsylvania
United States

Keywords

519, feature selection, Classification and discrimination; cluster analysis (statistical aspects), classification, multiple regression, Learning and adaptive systems in artificial intelligence, stepwise regression, false discovery rate

Impact byBIP!

	citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

Average

gold