2017
DOI: 10.1109/tvlsi.2016.2570283
|View full text |Cite
|
Sign up to set email alerts
|

Logic-Base Interconnect Design for Near Memory Computing in the Smart Memory Cube

Abstract: Hybrid memory cube (HMC) has promised to improve bandwidth, power consumption, and density for the next-generation main memory systems. In addition, 3-D integration gives a second shot for revisiting near memory computation to fill the gap between processors and memories. In this paper, we study the required infrastructure inside the HMC to support near memory computation in a modular and flexible fashion. We propose a fully backward compatible extension to the standard HMC called the smart memory cube, and de… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
26
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
3
2
2

Relationship

1
6

Authors

Journals

citations
Cited by 31 publications
(26 citation statements)
references
References 42 publications
0
26
0
Order By: Relevance
“…This configuration is found to achieve reasonable performance and efficiency through several simulations. Total available DRAM is 1GB in 4 stacked dies with DRAM banks of 32MB and a closed-page policy [25]. Low-interleavedaddressing is implemented as the HMC's default addressing scheme [26].…”
Section: Resultsmentioning
confidence: 99%
See 3 more Smart Citations
“…This configuration is found to achieve reasonable performance and efficiency through several simulations. Total available DRAM is 1GB in 4 stacked dies with DRAM banks of 32MB and a closed-page policy [25]. Low-interleavedaddressing is implemented as the HMC's default addressing scheme [26].…”
Section: Resultsmentioning
confidence: 99%
“…A fully functional and cycle-accurate (CA) RTL model of the NeuroCluster has been modeled in SystemVerilog, with the components adopted and reconfigured from [49]. This model along with a previously developed cycle-accurate model of the SMC [25] allows us to analyze the performance of tiled execution over a single SMC device considering the programming overheads.…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…In another work, Gao et al [20] developed hardware and software of a 3D stack memory and neardata processing architecture for in-memory analytics frameworks, including MapReduce, graph processing, and deep neural networks. Azarkhish et al [5] developed Smart Memory Cube and designed a high bandwidth interconnect to serve the bandwidth demand of PIM architecture. Zhang et al [49] explored PIM implemented via 3D die stacking.…”
Section: D Processing-in-memory Architecturesmentioning
confidence: 99%