Learning sparse classifiers with difference of convex functions algorithms

descriptionPublicationkeyboard_double_arrow_right Article 01 Aug 2013 English Publisher:Informa UK LimitedJournal:Optimization Methods and Software, volume 28, pages 830-854 (issn: 1055-6788, eissn: 1029-4937,

Copyright policy )

Authors: Ong, Cheng Soon; Le Thi, Hoai An;

doi: 10.1080/10556788.2011.652630

Learning sparse classifiers with difference of convex functions algorithms

- Summary
- Subjects
- Metrics

Abstract

Sparsity of a classifier is a desirable condition for high-dimensional data and large sample sizes. This paper investigates the two complementary notions of sparsity for binary classification: sparsity in the number of features and sparsity in the number of examples. Several different losses and regularizers are considered: the hinge loss and ramp loss, and l2, l1, approximate l0, and capped l1 regularization. We propose three new objective functions that further promote sparsity, the capped l1 regularization with hinge loss, and the ramp loss versions of approximate l0 and capped l1 regularization. We derive difference of convex functions algorithms DCA for solving these novel non-convex objective functions. The proposed algorithms are shown to converge in a finite number of iterations to a local minimum. Using simulated data and several data sets from the University of California Irvine UCI machine learning repository, we empirically investigate the fraction of features and examples required by the different classifiers.

Related Organizations

ETH Zurich
Switzerland
University of Lorraine
France
Metz
Germany

Keywords

[INFO] Computer Science [cs]

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	45
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%

Found an issue? Give us feedback

45

Top 10%

Fields of Science (3) View all

engineering and technology

electrical engineering, electronic engineering, information engineering

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

View all

Upload OA version

Are you the author of this publication? Upload your Open Access version to Zenodo!

It’s fast and easy, just two clicks!

uploadUpload now