descriptionPublicationkeyboard_double_arrow_right Article 01 Apr 2019Publisher:Institute of Electrical and Electronics Engineers (IEEE)Journal:IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, volume 38, pages 640-653 (issn: 0278-0070, eissn: 1937-4151,

Authors: Guohao Dai; Tianhao Huang; Yuze Chi; Jishen Zhao; Guangyu Sun; Yongpan Liu; Yu Wang; +2 Authors

doi: 10.1109/tcad.2018.2821565

GraphH: A Processing-in-Memory Architecture for Large-Scale Graph Processing

- Summary
- Subjects
- Related research
  (2)
- Metrics

Abstract

Large-scale graph processing requires the high bandwidth of data access. However, as graph computing continues to scale, it becomes increasingly challenging to achieve a high bandwidth on generic computing architectures. The primary reasons include: the random access pattern causing local bandwidth degradation, the poor locality leading to unpredictable global data access, heavy conflicts on updating the same vertex, and unbalanced workloads across processing units. Processing-in-memory (PIM) has been explored as a promising solution to providing high bandwidth, yet open questions of graph processing on PIM devices remain in: 1) how to design hardware specializations and the interconnection scheme to fully utilize bandwidth of PIM devices and ensure locality and 2) how to allocate data and schedule processing flow to avoid conflicts and balance workloads. In this paper, we propose GraphH, a PIM architecture for graph processing on the hybrid memory cube array, to tackle all four problems mentioned above. From the architecture perspective, we integrate SRAM-based on-chip vertex buffers to eliminate local bandwidth degradation. We also introduce reconfigurable double-mesh connection to provide high global bandwidth. From the algorithm perspective, partitioning and scheduling methods like index mapping interval-block and round interval pair are introduced to GraphH, thus workloads are balanced and conflicts are avoided. Two optimization methods are further introduced to reduce synchronization overhead and reuse on-chip data. The experimental results on graphs with billions of edges demonstrate that GraphH outperforms DDR-based graph processing systems by up to two orders of magnitude and $5.12 {\times }$ speedup against the previous PIM design.

Related Organizations

Peking University
China (People's Republic of)
University of California, Santa Barbara
United States
Peking University
China (People's Republic of)
Tsinghua University
China (People's Republic of)
University of California, Los Angeles
United States

View all View all

Keywords

Large-scale graph processing, Memory hierarchy, Hybrid memory cube (HMC), On-chip networks

2 Research products, page 1 of 1

GraphH: High Performance Big Graph Analytics in Small Clusters
2017IsAmongTopNSimilarDocuments
Strong products ofϰ-critical graphs
1993IsAmongTopNSimilarDocuments

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	80
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 1%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 1%