
Despite significant progress, there remain three limitations to the previous multi-view clustering algorithms. First, they often suffer from high computational complexity, restricting their feasibility for large-scale datasets. Second, they typically fuse multi-view information via one-stage fusion, neglecting the possibilities in multi-stage fusions. Third, dataset-specific hyperparameter-tuning is frequently required, further undermining their practicability. In light of this, we propose a fast multi-view clustering via ensembles (FastMICE) approach. Particularly, the concept of random view groups is presented to capture the versatile view-wise relationships, through which the hybrid early-late fusion strategy is designed to enable efficient multi-stage fusions. With multiple views extended to many view groups, three levels of diversity (w.r.t. features, anchors, and neighbors, respectively) are jointly leveraged for constructing the view-sharing bipartite graphs in the early-stage fusion. Then, a set of diversified base clusterings for different view groups are obtained via fast graph partitioning, which are further formulated into a unified bipartite graph for final clustering in the late-stage fusion. Notably, FastMICE has almost linear time and space complexity, and is free of dataset-specific tuning. Experiments on 22 multi-view datasets demonstrate its advantages in scalability (for extremely large datasets), superiority (in clustering performance), and simplicity (to be applied) over the state-of-the-art. Code available: https://github.com/huangdonghere/FastMICE.
To appear in IEEE Transactions on Knowledge and Data Engineering
FOS: Computer and information sciences, Computer Science - Machine Learning, Artificial intelligence, Spectral Clustering, Scalability, Machine Learning (stat.ML), Computer science, Visual Object Tracking and Person Re-identification, Machine Learning (cs.LG), Database, Cluster analysis, Image Feature Retrieval and Recognition Techniques, Statistics - Machine Learning, Computer Science, Physical Sciences, Multiple Object Tracking, Foreground Segmentation, Computer Vision and Pattern Recognition, Face Recognition and Dimensionality Reduction Techniques, Feature Matching, Dimensionality Reduction
FOS: Computer and information sciences, Computer Science - Machine Learning, Artificial intelligence, Spectral Clustering, Scalability, Machine Learning (stat.ML), Computer science, Visual Object Tracking and Person Re-identification, Machine Learning (cs.LG), Database, Cluster analysis, Image Feature Retrieval and Recognition Techniques, Statistics - Machine Learning, Computer Science, Physical Sciences, Multiple Object Tracking, Foreground Segmentation, Computer Vision and Pattern Recognition, Face Recognition and Dimensionality Reduction Techniques, Feature Matching, Dimensionality Reduction
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 77 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 1% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 1% |
