2016
DOI: 10.1007/s11390-016-1610-1
|View full text |Cite
|
Sign up to set email alerts
|

Performance-Centric Optimization for Racetrack Memory Based Register File on GPUs

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 10 publications
(6 citation statements)
references
References 19 publications
0
6
0
Order By: Relevance
“…This vertical arrangement is considered as a breakthrough, where a forest of such nanowires is arranged, which tremendously increases the storage capacity in RM. Such an arrangement of RM with large data storage capacity and minimum footprint is advocated to replace the current memory structures at different levels such as SRAM and DRAM cache [220,[227][228][229][230][231][232], GPU register [233][234][235], and off-chip (stand-alone) memory [236] owing to its non-volatility, high read/write speed, and lower read/write energy. Diligent efforts needed to investigate the feasibility of replacing the main memory by RM.…”
Section: Magnetic Domain Wall Nanowirementioning
confidence: 99%
“…This vertical arrangement is considered as a breakthrough, where a forest of such nanowires is arranged, which tremendously increases the storage capacity in RM. Such an arrangement of RM with large data storage capacity and minimum footprint is advocated to replace the current memory structures at different levels such as SRAM and DRAM cache [220,[227][228][229][230][231][232], GPU register [233][234][235], and off-chip (stand-alone) memory [236] owing to its non-volatility, high read/write speed, and lower read/write energy. Diligent efforts needed to investigate the feasibility of replacing the main memory by RM.…”
Section: Magnetic Domain Wall Nanowirementioning
confidence: 99%
“…The immense storage requirement of GPU applications makes RTMs a preferable alternative to be employed as a GPU register file. To this end, various proposals propose RTM-based GPU register files to alleviate the high leakage and scalability problems of conventional SRAMbased register files [86]- [89]. These proposals are based on the tenet of reducing the shift overhead via different techniques that include smart register renaming [86], [87], [89], proactive preshifting [87]- [90], and intelligent thread scheduling [86].…”
Section: B Rtm Gpu Register Filementioning
confidence: 99%
“…line 18), when the vertex-to-group weight of the selected vertex is equal with both sub-groups, the algorithm compares the edge weight of the selected vertex v * with the last vertices of both groups (v p in д r and v q in д l ) and favors the maximum edge weight (cf. lines [19][20][21][22][23][24].…”
Section: The Shiftsreduce Heuristicmentioning
confidence: 99%
“…While most of these solutions effectively improve both performance and energy, their applicability to RMs is of secondary interests (hybrid RM-S/DRAM memory system). Fundamentally, the data-placement solutions in RMs such as for GPU register files [24], scratchpad memories [13,29], and stacks [14] aim at reducing the number of RM shifts.…”
Section: Related Workmentioning
confidence: 99%