Benchmarking the graphulo processing framework

descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Conference object , Other literature type 01 Sep 2016Embargo end date: 01 Jan 2016Publisher:IEEEJournal:2016 IEEE High Performance Extreme Computing Conference (HPEC)Funded by:FCT | D4

Authors: Timothy Weale; Vijay Gadepally; Dylan Hutchison; Jeremy Kepner;

doi: 10.1109/hpec.2016.7761640 , 10.48550/arxiv.1609.08642

arXiv: 1609.08642

Benchmarking the graphulo processing framework

- Summary
- Subjects
- Metrics

Abstract

Graph algorithms have wide applicablity to a variety of domains and are often used on massive datasets. Recent standardization efforts such as the GraphBLAS specify a set of key computational kernels that hardware and software developers can adhere to. Graphulo is a processing framework that enables GraphBLAS kernels in the Apache Accumulo database. In our previous work, we have demonstrated a core Graphulo operation called \textit{TableMult} that performs large-scale multiplication operations of database tables. In this article, we present the results of scaling the Graphulo engine to larger problems and scalablity when a greater number of resources is used. Specifically, we present two experiments that demonstrate Graphulo scaling performance is linear with the number of available resources. The first experiment demonstrates cluster processing rates through Graphulo's TableMult operator on two large graphs, scaled between $2^{17}$ and $2^{19}$ vertices. The second experiment uses TableMult to extract a random set of rows from a large graph ($2^{19}$ nodes) to simulate a cued graph analytic. These benchmarking results are of relevance to Graphulo users who wish to apply Graphulo to their graph problems.

5 pages, 4 figures, IEEE High Performance Extreme Computing (HPEC) conference 2016

Related Organizations

Washington State University
United States
University of Washington
United States
University of Mary
United States
MIT Lincoln Laboratory
United States
UNIVERSITY OF WASHINGTON
United States

View all View all

Keywords

Performance (cs.PF), FOS: Computer and information sciences, Computer Science - Performance, Computer Science - Databases, Computer Science - Mathematical Software, Databases (cs.DB), Mathematical Software (cs.MS)

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	3
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

3

Average

Green

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Funded by

FCT| D4

Related to Research communities

UArctic