Parallel graph algorithms in constant adaptive rounds

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 01 Sep 2020Embargo end date: 01 Jan 2020 English Publisher:Association for Computing Machinery (ACM)Journal:Proceedings of the VLDB Endowment, volume 13, pages 3,588-3,602 (issn: 2150-8097,

Copyright policy )

Authors: Behnezhad, Soheil; Dhulipala, Laxman; Esfandiari, Hossein; Lacki, Jakub; Mirrokni, Vahab; Schudy, Warren;

doi: 10.14778/3424573.3424579 , 10.48550/arxiv.2009.11552

arXiv: 2009.11552

handle: 1721.1/136701

Parallel graph algorithms in constant adaptive rounds

- Summary
- Subjects
- Metrics

Abstract

We study fundamental graph problems such as graph connectivity, minimum spanning forest (MSF), and approximate maximum (weight) matching in a distributed setting. In particular, we focus on the Adaptive Massively Parallel Computation (AMPC) model, which is a theoretical model that captures MapReduce-like computation augmented with a distributed hash table. We show the first AMPC algorithms for all of the studied problems that run in a constant number of rounds and use only O ( n ϵ ) space per machine, where 0 < ϵ < 1. Our results improve both upon the previous results in the AMPC model, as well as the best-known results in the MPC model, which is the theoretical model underpinning many popular distributed computation frameworks, such as MapReduce, Hadoop, Beam, Pregel and Giraph. Finally, we provide an empirical comparison of the algorithms in the MPC and AMPC models in a fault-tolerant distributed computation environment. We empirically evaluate our algorithms on a set of large real-world graphs and show that our AMPC algorithms can achieve improvements in both running time and round-complexity over optimized MPC baselines.

Related Organizations

University of Maryland, College Park
United States
University of Maryland, College Park
United States
Google (Canada)
Canada
Google (United States)
United States
Massachusetts Institute of Technology
United States

Keywords

FOS: Computer and information sciences, Computer Science - Distributed, Parallel, and Cluster Computing, Computer Science - Data Structures and Algorithms, Data Structures and Algorithms (cs.DS), Distributed, Parallel, and Cluster Computing (cs.DC)

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	9
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%