Kernel Manifold Alignment for Domain Adaptation

descriptionPublicationkeyboard_double_arrow_right Article 12 Feb 2016Embargo end date: 03 Nov 2016 Switzerland English Publisher:Public Library of Science (PLoS)Journal:PLOS ONE, volume 11, page e0148655 (eissn: 1932-6203,

Copyright policy )Funded by:SNSF | Multimodal machine learni..., EC | SEDAL

Authors: Tuia, Devis; Camps-Valls, Gustau;

APC: 1,416.84 EUR

doi: 10.1371/journal.pone.0148655 , 10.5167/uzh-127043 , 10.5281/zenodo.4463170 , 10.5281/zenodo.4463171

pmid: 26872269

pmc: PMC4752280

Kernel Manifold Alignment for Domain Adaptation

- Summary
- Subjects
- Metrics

Abstract

The wealth of sensory data coming from different modalities has opened numerous opportunities for data analysis. The data are of increasing volume, complexity and dimensionality, thus calling for new methodological innovations towards multimodal data processing. However, multimodal architectures must rely on models able to adapt to changes in the data distribution. Differences in the density functions can be due to changes in acquisition conditions (pose, illumination), sensors characteristics (number of channels, resolution) or different views (e.g. street level vs. aerial views of a same building). We call these different acquisition modes domains, and refer to the adaptation problem as domain adaptation. In this paper, instead of adapting the trained models themselves, we alternatively focus on finding mappings of the data sources into a common, semantically meaningful, representation domain. This field of manifold alignment extends traditional techniques in statistics such as canonical correlation analysis (CCA) to deal with nonlinear adaptation and possibly non-corresponding data pairs between the domains. We introduce a kernel method for manifold alignment (KEMA) that can match an arbitrary number of data sources without needing corresponding pairs, just few labeled examples in all domains. KEMA has interesting properties: 1) it generalizes other manifold alignment methods, 2) it can align manifolds of very different complexities, performing a discriminative alignment preserving each manifold inner structure, 3) it can define a domain-specific metric to cope with multimodal specificities, 4) it can align data spaces of different dimensionality, 5) it is robust to strong nonlinear feature deformations, and 6) it is closed-form invertible, which allows transfer across domains and data synthesis. To authors’ knowledge this is the first method addressing all these important issues at once. We also present a reduced-rank version of KEMA for computational efficiency, and discuss the generalization performance of KEMA under Rademacher principles of stability. Aligning multimodal data with KEMA reports outstanding benefits when used as a data pre-conditioner step in the standard data analysis processing chain. KEMA exhibits very good performance over competing methods in synthetic controlled examples, visual object recognition and recognition of facial expressions tasks. KEMA is especially well-suited to deal with high-dimensional problems, such as images and videos, and under complicated distortions, twists and warpings of the data manifolds. A fully functional toolbox is available at https://github.com/dtuia/KEMA.git.

Country

Switzerland

Related Organizations

University of Valencia
Spain
University of Zurich
Switzerland
Jiangnan University
China (People's Republic of)
Xiangnan University
China (People's Republic of)
Swiss National Science Foundation
Switzerland

View all View all

Keywords

1000 Multidisciplinary, Science, Q, R, Information Storage and Retrieval, 1100 General Agricultural and Biological Sciences, 910 Geography & travel, Pattern Recognition, Automated, Facial Expression, 10122 Institute of Geography, 1300 General Biochemistry, Genetics and Molecular Biology, Data Interpretation, Statistical, Image Interpretation, Computer-Assisted, Medicine, Data Mining, Humans, 910 Geography & travel, Algorithms, Research Article

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	70
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 1%