Architectural support for operating system-driven CMP cache management

Rafique, Nauman; Lim, Wontaek; Thottethodi, Mithuna

doi:10.1145/1152154.1152160

Cited by 180 publications

(132 citation statements)

References 23 publications

Supporting

Mentioning

132

Contrasting

Order By: Relevance

“…Our second innovation is a feedback-based adaptive bandwidth sharing policy in which we periodically tune the bandwidth assigned to the sharers in order to achieve specified DRAM latencies. Adaptive bandwidth management does not add to the complexity of the hardware because it can be done entirely in software at the operating system (OS) or the hypervisor by using interfaces and mechanisms similar to those proposed for shared cache management [29]. While that interface supports various features such as thread migration and thread grouping (wherein a group of threads acts as a single resource principal), our study assumes that each thread running on a processor is a unique resource principal with its own allocated bandwidth share.…”

Section: Introductionmentioning

confidence: 99%

Effective Management of DRAM Bandwidth in Multicore Processors

Rafique¹,

Lim²,

Thottethodi³

2007

16th International Conference on Parallel Architecture and Compilation Techniques (PACT 2007)

Self Cite

View full text Add to dashboard Cite

show abstract

Section: Introductionmentioning

confidence: 99%

Effective Management of DRAM Bandwidth in Multicore Processors

Rafique¹,

Lim²,

Thottethodi³

2007

16th International Conference on Parallel Architecture and Compilation Techniques (PACT 2007)

Self Cite

View full text Add to dashboard Cite

show abstract

“…Figure 7 shows the throughput of selected benchmark pairs with uncontrolled Pseudo LRU and the two cache partition schemes. We use fair speedup defined in Equation 14 to measure the throughput of concurrent running benchmark pairs. Note that taller bar in the figure means higher throughput.…”

Section: Evaluation Of Fairness Metricsmentioning

confidence: 99%

“…[4] proposed a framework to provide QoS for resources including shared caches. [14] designed architectural support for OS to manage shared caches.…”

Section: Related Workmentioning

confidence: 99%

Cache Sharing Management for Performance Fairness in Chip Multiprocessors

Zhou

Chen

Zheng

2009

2009 18th International Conference on Parallel Architectures and Compilation Techniques

View full text Add to dashboard Cite

show abstract

“…However, most of these studies have focused on a single component of the entire system. For example, techniques have been proposed to reduce cache capacity interference [1,[3][4][5][6][7], cache bandwidth interference [13] and memory bus transfer interference [2,8,9]. Unfortunately, a technique that reduces interference in one component is not adequate to provide interference control for the complete memory system.…”

Section: Related Workmentioning

confidence: 99%

“…Since these effects are clearly undesirable, there is a need for architectural techniques that provide predictable performance and improve fairness. Previously, cache capacity interference has received a great deal of attention [1,[3][4][5][6][7] while only a few researchers have proposed techniques that reduce memory bus interference [2,8,9]. Furthermore, there has been little interest in the details of designing a complete, thread-aware memory system [10][11][12].…”

Section: Introductionmentioning

confidence: 99%

A Quantitative Study of Memory System Interference in Chip Multiprocessor Architectures

Jahre

Grannæs

Natvig

2009

2009 11th IEEE International Conference on High Performance Computing and Communications

View full text Add to dashboard Cite

Abstract-The potential for destructive interference between running processes is increased as Chip Multiprocessors (CMPs) share more on-chip resources. We believe that understanding the nature of memory system interference is vital to achieve good fairness/complexity/performance trade-offs in CMPs. Our goal in this work is to quantify the latency penalties due to interference in all hardware-controlled, shared units (i.e. the onchip interconnect, shared cache and memory bus). To achieve this, we simulate a wide variety of realistic CMP architectures. In particular, we vary the number of cores, interconnect topology, shared cache size and off-chip memory bandwidth. We observe that interference in the off-chip memory bus accounts for between 63% and 87% of the total interference impact while the impact of cache capacity interference can be lower than indicated by previous studies (between 5% and 32% of the total impact). In addition, as much as 11% of the total impact can be due to uncontrolled allocation of shared cache Miss Status Holding Registers (MSHRs).

show abstract

Architectural support for operating system-driven CMP cache management

Cited by 180 publications

References 23 publications

Effective Management of DRAM Bandwidth in Multicore Processors

Effective Management of DRAM Bandwidth in Multicore Processors

Cache Sharing Management for Performance Fairness in Chip Multiprocessors

A Quantitative Study of Memory System Interference in Chip Multiprocessor Architectures

Contact Info

Product

Resources

About