2016 ACM/IEEE 43rd Annual International Symposium on Computer Architecture (ISCA) 2016
DOI: 10.1109/isca.2016.24
|View full text |Cite
|
Sign up to set email alerts
|

Energy Efficient Architecture for Graph Analytics Accelerators

Abstract: Specialized hardware accelerators can significantly improve the performance and power efficiency of compute systems. In this paper, we focus on hardware accelerators for graph analytics applications and propose a configurable architecture template that is specifically optimized for iterative vertex-centric graph applications with irregular access patterns and asymmetric convergence. The proposed architecture addresses the limitations of the existing multi-core CPU and GPU architectures for these types of appli… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
39
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 105 publications
(39 citation statements)
references
References 27 publications
0
39
0
Order By: Relevance
“…The area, power, and access latency of the on-chip scratchpad memory are estimated using Cacti 6.5 [1]. Since Cacti only supports down to 32 nm technologies, we apply four different scaling factors to convert them to 12 nm technology as shown in [33,36]. The energy of HBM 1.0 is estimated with 7 pJ/bit as in [32,41].…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…The area, power, and access latency of the on-chip scratchpad memory are estimated using Cacti 6.5 [1]. Since Cacti only supports down to 32 nm technologies, we apply four different scaling factors to convert them to 12 nm technology as shown in [33,36]. The energy of HBM 1.0 is estimated with 7 pJ/bit as in [32,41].…”
Section: Methodsmentioning
confidence: 99%
“…GCNs demand specialized architecture design. With the emergence of graph analytics and neural networks workloads, a lot of hardware architecture designs are proposed to accelerate these workloads [7,8,17,22,33]. For example, Graphicionado [17] is tailored for graph analtyics; while TPU [22] focuses on the acceleration of neural networks.…”
Section: Related Workmentioning
confidence: 99%
“…If incoming edges are used in the edge array, this layout is called Compressed Sparse Column (CSC). The compressed adjacency list graph is relatively compact and beneficial to many graph accelerators [29,71] . Note that the edges of each vertex are stored sequentially.…”
Section: Graph Layout Reorganizationmentioning
confidence: 99%
“…Specifically in terms of graph processing, it has been also witnessed that a large number of relevant studies build their graph processing accelerators based on FPGA [24][25][26][27][28] and ASIC [16,[29][30][31] . Evaluation on these http://www.riscv.org, Jan. 2019.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation