Online Learning with Kernels

descriptionPublicationkeyboard_double_arrow_right Article , Part of book or chapter of book , Conference object 08 Nov 2002 Australia English Publisher:Institute of Electrical and Electronics Engineers (IEEE)Journal:IEEE Transactions on Signal Processing, volume 52, pages 2,165-2,176 (issn: 1053-587X,

Copyright policy )

Authors: Kivinen, Jyrki; Smola, Alexander; Williamson, Robert;

doi: 10.1109/tsp.2004.830991 , 10.7551/mitpress/1120.003.0105

handle: 1885/80760

Online Learning with Kernels

- Summary
- Subjects
- Metrics

Abstract

Kernel-based algorithms such as support vector machines have achieved considerable success in various problems in batch setting, where all of the training data is available in advance. Support vector machines combine the so-called kernel trick with the large margin idea. There has been little use of these methods in an online setting suitable for real-time applications. In this paper, we consider online learning in a reproducing kernel Hilbert space. By considering classical stochastic gradient descent within a feature space and the use of some straightforward tricks, we develop simple and computationally efficient algorithms for a wide range of problems such as classification, regression, and novelty detection. In addition to allowing the exploitation of the kernel trick in an online setting, we examine the value of large margins for classification in the online setting with a drifting target. We derive worst-case loss bounds, and moreover, we show the convergence of the hypothesis to the minimizer of the regularized risk functional. We present some experimental results that support the theory as well as illustrating the power of the new algorithms for online novelty detection.

Country

Australia

Related Organizations

Australian National University
Australia
University of Helsinki
Finland

Keywords

Optimization, Keywords: Computational methods, Learning systems, Random processes, Online learn, Kernel based algorithms, 510, Large margin classifiers, Functions, Convergence of numerical methods, Novelty detection, Regression analysis, Theorem proving, Neural networks

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	691
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 0.1%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 0.1%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%