Online Learning of Noisy Data

descriptionPublicationkeyboard_double_arrow_right Article 01 Dec 2011 Italy Publisher:Institute of Electrical and Electronics Engineers (IEEE)Journal:IEEE Transactions on Information Theory, volume 57, pages 7,907-7,931 (issn: 0018-9448,

Copyright policy )Funded by:EC | PASCAL2

Authors: N. Cesa-Bianchi; S. Shalev Shwartz; O. Shamir;

doi: 10.1109/tit.2011.2164053

handle: 2434/223626

Online Learning of Noisy Data

- Summary
- Metrics

Abstract

We study online learning of linear and kernel-based predictors, when individual examples are corrupted by random noise, and both examples and noise type can be chosen adversarially and change over time. We begin with the setting where some auxiliary information on the noise distribution is provided, and we wish to learn predictors with respect to the squared loss. Depending on the auxiliary information, we show how one can learn linear and kernel-based predictors, using just 1 or 2 noisy copies of each example. We then turn to discuss a general setting where virtually nothing is known about the noise distribution, and one wishes to learn with respect to general losses and using linear and kernel-based predictors. We show how this can be achieved using a random, essentially constant number of noisy copies of each example. Allowing multiple copies cannot be avoided: Indeed, we show that the setting becomes impossible when only one noisy copy of each instance can be accessed. To obtain our results we introduce several novel techniques, some of which might be of independent interest.

Country

Italy

Related Organizations

Hebrew University of Jerusalem
Israel
Microsoft Research New England (United States)
United States
Microsoft (United States)
United States
Microsoft (United Kingdom)
United Kingdom
University of Milan
Italy

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	28
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%