Name: Sequential and Shared-Memory Parallel Algorithms for Partitioned Local Depths
Keywords: FOS: Computer and information sciences, Computer Science - Machine Learning, D.1.3, Computer Science - Distributed, Parallel, and Cluster Computing, Statistics - Machine Learning, Machine Learning (stat.ML), Distributed, Parallel, and Cluster Computing (cs.DC), 68W10, Machine Learning (cs.LG)

descriptionPublicationkeyboard_double_arrow_right Part of book or chapter of book , Article , Preprint 01 Jan 2024Embargo end date: 01 Jan 2023 English Publisher:Society for Industrial & Applied Mathematics (SIAM)Funded by:NSF | Collaborative Research: O...

Authors: Devarakonda, Aditya; Ballard, Grey;

doi: 10.1137/1.9781611977967.5 , 10.48550/arxiv.2307.16652

arXiv: 2307.16652

Sequential and Shared-Memory Parallel Algorithms for Partitioned Local Depths

- Summary
- Subjects
- Metrics

Abstract

In this work, we design, analyze, and optimize sequential and shared-memory parallel algorithms for partitioned local depths (PaLD). Given a set of data points and pairwise distances, PaLD is a method for identifying strength of pairwise relationships based on relative distances, enabling the identification of strong ties within dense and sparse communities even if their sizes and within-community absolute distances vary greatly. We design two algorithmic variants that perform community structure analysis through triplet comparisons of pairwise distances. We present theoretical analyses of computation and communication costs and prove that the sequential algorithms are communication optimal, up to constant factors. We introduce performance optimization strategies that yield sequential speedups of up to $29\times$ over a baseline sequential implementation and parallel speedups of up to $19.4\times$ over optimized sequential implementations using up to $32$ threads on an Intel multicore CPU.

Related Organizations

View all View all

Keywords

FOS: Computer and information sciences, Computer Science - Machine Learning, D.1.3, Computer Science - Distributed, Parallel, and Cluster Computing, Statistics - Machine Learning, Machine Learning (stat.ML), Distributed, Parallel, and Cluster Computing (cs.DC), 68W10, Machine Learning (cs.LG)

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

Average

Green

Funded by

NSF| Collaborative Research: OAC Core: Robust, Scalable, and Practical Low-Rank Approximation