2016
DOI: 10.1587/elex.13.20160527
|View full text |Cite
|
Sign up to set email alerts
|

Exploring new features of high-bandwidth memory for GPUs

Abstract: Due to the off-chip I/O pin and power constraints of GDDR5, HBM has been proposed to provide higher bandwidth and lower power consumption for GPUs. In this paper, we first provide detailed comparison between HBM and GDDR5 and expose two unique features of HBM: dualcommand and pseudo channel mode. Second, we analyze the effectiveness of these two features and show that neither notably contributes to performance. However, by combining pseudo channel mode with cache architecture supporting fine-grained cache-line… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 6 publications
(7 citation statements)
references
References 14 publications
0
7
0
Order By: Relevance
“…GEM-5 does not have an energy model for HBM. So we created a model based on DDR4 as described in [37]. The energy consumption of through-silicon vias (TSV) was not considered.…”
Section: B Energy Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…GEM-5 does not have an energy model for HBM. So we created a model based on DDR4 as described in [37]. The energy consumption of through-silicon vias (TSV) was not considered.…”
Section: B Energy Resultsmentioning
confidence: 99%
“…For the energy models, we employ the models integrated in GEM-5, where the energy model of HBM is not supported. Hence, we properly built the HBM energy models based on [37]. We simulated the performance and the energy of seven architectures: 1) DRAM normally refreshed, VOLUME 4, 2016 This work is licensed under a Creative Commons Attribution 4.0 License.…”
Section: Discussionmentioning
confidence: 99%
“…Some initial works present HBM as an emerging memory standard that can provide bandwidth superior to 256GB/s as well as offers lower power consumption [22]. More recent works compare HBM and DDR for high performance systems [4]. [18] presents the challenges of capacity scaling of HBM device by stacking more DRAM dies to make a taller stack.…”
Section: Synthetic Experimentsmentioning
confidence: 99%
“…Also, unlike the other variant of 3Dstacked DRAM -Hybrid Memory Cube (HMC), HBM is adopted as a JEDEC standard, implements much wider data bus and resides on the same silicon interposer as the processing unit. HBM, common in GPUs, FPGA-CPUs and System on Chips like the Xilinx UltraScale+, is better equipped to handle increased memory requirements of GPU and accelerator-based architectures [4], [5]. A recent study suggests the combination of low power consumption and high bandwidth make this category of memory ideal for embedded systems as well [6].…”
Section: Introductionmentioning
confidence: 99%
“…The diversity of high-speed I/O standards for DRAM requires an automated way of doing production tests over a variety of channels [1,2,3,4,5,6,7]. DRAM standards, such as DDR4, LPDDR4, GDDR5, and HBM address different application-specific needs in PCs, mobile devices, graphics processors, and artificial intelligence (AI) accelerators, respectively [8,9,10,11], and use different I/O channels whose loss characteristics span a wide variety, as shown in Fig.…”
Section: Introductionmentioning
confidence: 99%