Energy/power breakdown of pipelined nanometer caches (90nm/65nm/45nm/32nm)

Rodríguez, Samuel; Jacob, Bruce

doi:10.1145/1165573.1165581

Cited by 35 publications

(17 citation statements)

References 18 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In active mode, with the use of above mentioned techniques, the leakage power can be safely reduced to less than 20%. Recent product datasheets and publications [16], [3] also substantiate that leakage ratio can be assumed to be 20% in 65nm technology. When the processor is clock gated, apart from leakage, only the PLL is active.…”

Section: Alpha 21264 Power Model In 65nmmentioning

confidence: 93%

Clock gate on abort: Towards energy-efficient hardware Transactional Memory

Sanyal

Roy²,

Cristal

et al. 2009

2009 IEEE International Symposium on Parallel &Amp; Distributed Processing

View full text Add to dashboard Cite

Abstract-Transactional Memory (TM) is an emerging technology which promises to make parallel programming easier compared to earlier lock based approaches. However, as with any form of speculation, Transactional Memory too wastes a considerable amount of energy when the speculation goes wrong and transaction aborts. For Transactional Memory this wastage will typically be quite high because programmer will often mark a large portion of the code to be executed transactionally [4].We are proposing to turn-off a processor dynamically by gating all its clocks, whenever any transaction running in it is aborted. We have described a novel protocol which can be used in the Scalable-TCC like Hardware Transactional Memory systems. Also in the protocol we are proposing a gating-aware contention management policy to set the duration of the clock gating period precisely so that both performance and energy can be improved.With our proposal we got an average 19% savings in the total consumed energy and even an average speed-up of 4%.

show abstract

Section: Alpha 21264 Power Model In 65nmmentioning

confidence: 93%

Clock gate on abort: Towards energy-efficient hardware Transactional Memory

Sanyal

Roy²,

Cristal

et al. 2009

2009 IEEE International Symposium on Parallel &Amp; Distributed Processing

View full text Add to dashboard Cite

show abstract

“…We focus on I off as the primary contributor to leakage in nanometer caches [1]. Figure 1 shows a 6-transistor SRAM cell storing a 1 logic value.…”

Section: Value-dependence Of Sram Leakagementioning

confidence: 99%

“…A main cause behind such variations is the random dopant fluctuation (RDF) which differently affects even adjacent transistors and changes their V th . Since subthreshold leakage (I off , which is the dominant source of leakage in SRAM memories in nanometer technologies [1]) exponentially depends on V th , much bigger variations are observed in I off following the variations in V th . In the conventional 6-transistor SRAM cells, three of the transistors contribute to subthreshold leakage when storing a 1, and the other three contribute to leakage when storing a 0.…”

Section: Introductionmentioning

confidence: 99%

“…by reordering code blocks in a scratchpad memory), and/or (ii) by invalidating and refilling highly leaky SRAM cells with the less leaky value when going to standby mode. Since share of leakage in total power consumption of SRAM-based memories is increasing with technology scaling [1,2] and SRAM-based memories comprise the largest component of processor-based embedded systems (e.g. 70% of StrongARM [3]) and are among main sources of power consumption there [1], finding new opportunities for leakage reduction is important.…”

Section: Introductionmentioning

confidence: 99%

“…Since share of leakage in total power consumption of SRAM-based memories is increasing with technology scaling [1,2] and SRAM-based memories comprise the largest component of processor-based embedded systems (e.g. 70% of StrongARM [3]) and are among main sources of power consumption there [1], finding new opportunities for leakage reduction is important. Especially note that SRAM value-control can be done at system and software level without needing expensive changes at the circuit-level or manufacturing process.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Value-dependence of SRAM leakage in deca-nanometer technologies

Goudarzi

Ishihara

2008

IEICE Electron. Express

View full text Add to dashboard Cite

Within-die process variation increases with technology scaling in nanometer era. Due to uncorrelated random variations in the threshold voltage (V th ), neighboring transistors in a 6-T SRAM have different V th and dissipate different subthreshold leakages. Since 3 transistors leak when the cell stores a 1 and the other 3 leak when it stores a 0, total cell leakage depends on its stored value. Using Monte Carlo simulations, we show that this difference averages 46% at a variation of 58% in V th . This phenomenon can be used to reduce leakage of SRAM-based memories by value control.

show abstract

Influence of Stacked 3D Memory/Cache Architectures on GPUs

Maashri

Sun²,

Dong³

et al. 2010

Integrated Circuits and Systems

View full text Add to dashboard Cite

Graphics Processing Units (GPUs) are highly parallel processing units that offload graphics rendering from microprocessors. For over 20 years, these units were exclusively utilized for graphics processing and over that period, more and more technological breakthroughs have been accomplished in improving the computational power of GPUs. One of the latest advances is the ability to program the GPU pipeline, allowing non-graphics applications and algorithms to run on top of the GPU. This has opened a whole new research area that is concerned with utilizing GPUs for running general purpose applications. Consequently, this has put more pressure on manufacturers in seeking innovative ways of improving GPUs even more. For instance, 3D die-stacking-another emerging technology-can be considered in improving GPU's performance.In this chapter, we discuss how 3D technology can be implemented in GPUs. We also investigate the problems and constraints of implementing such a technology and propose and assess solutions to these problems. Moreover, we propose architectural designs for the GPU that implements 3D technology and evaluate these designs in terms of cost, power consumption and thermal profile.However, before we delve into that discussion, it is very important for the reader to understand how GPU works and what its architecture looks like. Therefore, the next section introduces GPU technology and its hardware architecture. Also, we shortly discuss the 3D technology and the benefits that it offers for this demanding application.

show abstract

Energy/power breakdown of pipelined nanometer caches (90nm/65nm/45nm/32nm)

Cited by 35 publications

References 18 publications

Clock gate on abort: Towards energy-efficient hardware Transactional Memory

Clock gate on abort: Towards energy-efficient hardware Transactional Memory

Value-dependence of SRAM leakage in deca-nanometer technologies

Influence of Stacked 3D Memory/Cache Architectures on GPUs

Contact Info

Product

Resources

About