Error-robust multi-view clustering

descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Conference object 01 Dec 2017Embargo end date: 01 Jan 2018Publisher:IEEEJournal:2017 IEEE International Conference on Big Data (Big Data)

Authors: Mehrnaz Najafi; Lifang He 0001; Philip S. Yu;

doi: 10.1109/bigdata.2017.8257989 , 10.48550/arxiv.1801.00384

arXiv: 1801.00384

Error-robust multi-view clustering

- Summary
- Subjects
- Metrics

Abstract

In the era of big data, data may come from multiple sources, known as multi-view data. Multi-view clustering aims at generating better clusters by exploiting complementary and consistent information from multiple views rather than relying on the individual view. Due to inevitable system errors caused by data-captured sensors or others, the data in each view may be erroneous. Various types of errors behave differently and inconsistently in each view. More precisely, error could exhibit as noise and corruptions in reality. Unfortunately, none of the existing multi-view clustering approaches handle all of these error types. Consequently, their clustering performance is dramatically degraded. In this paper, we propose a novel Markov chain method for Error-Robust Multi-View Clustering (EMVC). By decomposing each view into a shared transition probability matrix and error matrix and imposing structured sparsity-inducing norms on error matrices, we characterize and handle typical types of errors explicitly. To solve the challenging optimization problem, we propose a new efficient algorithm based on Augmented Lagrangian Multipliers and prove its convergence rigorously. Experimental results on various synthetic and real-world datasets show the superiority of the proposed EMVC method over the baseline methods and its robustness against different types of errors.

10 pages, 2017 IEEE International Conference on Big Data (Big Data 2017)

Related Organizations

Cornell University
United States
University of Illinois at Chicago
United States
University of Chicago
United States
University of Illinois at Chicago
United States

Keywords

FOS: Computer and information sciences, Computer Science - Machine Learning, Machine Learning (cs.LG)

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	11
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

11

Top 10%

Average

Green

Fields of Science (4) View all

natural sciences

Fields of Science

natural sciences

View all