A meta-graph approach to analyze subgraph-centric distributed programming models

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 01 Dec 2016Embargo end date: 01 Jan 2015Publisher:IEEEJournal:2016 IEEE International Conference on Big Data (Big Data)

Authors: Dindokar, Ravikant; Choudhury, Neel; Simmhan, Yogesh;

doi: 10.1109/bigdata.2016.7840587 , 10.48550/arxiv.1508.04265

arXiv: 1508.04265

A meta-graph approach to analyze subgraph-centric distributed programming models

- Summary
- Subjects
- Related research
  (2)
- Metrics

Abstract

Component-centric distributed graph processing platforms that use a bulk synchronous parallel (BSP) programming model have gained traction. These address the short-comings of Big Data abstractions/platforms like MapReduce/Hadoop for large-scale graph processing. However, there is limited literature on foundational aspects of the behavior of these component-centric abstractions for different graphs, graph partitioning, and graph algorithms. Here, we propose a analytical approach based on a meta-graph sketch to examine the characteristics of component-centric graph programming models at a coarse granularity. In particular, we apply this sketch to subgraph- and block-centric abstractions, and draw a comparison with vertex-centric models like Google's Pregel. First, we explore the impact of various graph partitioning techniques on the meta-graph, and next consider the impact of the meta-graph on graph algorithms. This decouples the unwieldy large graph and their partitioning specific artifacts from their algorithmic analysis. We use 5 spatial and powerlaw graphs as exemplars, four different partitioning strategies, and PageRank and Breadth First Search as canonical algorithms. These analysis over the meta-graphs provide a reliable measure of the expected number of supersteps, and the communication and computational complexity of the algorithms for various graphs, and the relative merits of subgraph-centric models over vertex-centric ones.

Related Organizations

Indian Institute of Science Bangalore
India

Keywords

FOS: Computer and information sciences, Computer Science - Distributed, Parallel, and Cluster Computing, Distributed, Parallel, and Cluster Computing (cs.DC)

2 Research products, page 1 of 1

Component-Centric Reduced Order Modeling for the Prediction of the Nonlinear Geometric Response of a Part of a Stiffened Structure
2018IsAmongTopNSimilarDocuments
Characterization of Vertex-Centric Breadth First Search for Lattice Graphs
2017IsAmongTopNSimilarDocuments

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	6
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%