Universal Algorithms for Clustering Problems

descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Conference object 09 Mar 2023Embargo end date: 01 Jan 2021 Germany English Publisher:Association for Computing Machinery (ACM)Journal:ACM Transactions on Algorithms, volume 19, pages 1-46 (issn: 1549-6325, eissn: 1549-6333,

Copyright policy )Funded by:NSF | AitF: FULL: Collaborative..., NSF | CAREER: New Directions in..., NSF | AitF: Full: Collaborative... +1 projects

Authors: Arun Ganesh; Bruce M. Maggs; Debmalya Panigrahi;

doi: 10.1145/3572840 , 10.48550/arxiv.2105.02363

arXiv: 2105.02363

Universal Algorithms for Clustering Problems

- Summary
- Subjects
- Metrics

Abstract

This article presentsuniversalalgorithms for clustering problems, including the widely studiedk-median,k-means, andk-center objectives. The input is a metric space containing allpotentialclient locations. The algorithm must selectkcluster centers such that they are a good solution foranysubset of clients that actually realize. Specifically, we aim for lowregret, defined as the maximum over all subsets of the difference between the cost of the algorithm’s solution and that of an optimal solution. A universal algorithm’s solutionSolfor a clustering problem is said to be an α , β-approximation if for all subsets of clientsC′, it satisfiessol(C′) ≤ α ċopt(C′) + β ċmr, whereopt(C′ is the cost of the optimal solution for clients (C′) andmris the minimum regret achievable by any solution.Our main results are universal algorithms for the standard clustering objectives ofk-median,k-means, andk-center that achieve (O(1),O(1))-approximations. These results are obtained via a novel framework for universal algorithms using linear programming (LP) relaxations. These results generalize to other ℓp-objectives and the setting where some subset of the clients arefixed. We also give hardness results showing that (α, β)-approximation is NP-hard if α or β is at most a certain constant, even for the widely studied special case of Euclidean metric spaces. This shows that in some sense, (O(1),O(1))-approximation is the strongest type of guarantee obtainable for universal clustering.

Country

Germany

Related Organizations

Leibniz Association
Germany
University of California, Berkeley
United States
Duke University
United States
Schloss Dagstuhl – Leibniz Center for Informatics
Germany

Keywords

FOS: Computer and information sciences, universal algorithms, k-means, Computer science, 004, k-median, Computer Science - Data Structures and Algorithms, Data Structures and Algorithms (cs.DS), k-center, clustering, ddc: ddc:004

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	4
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%