IEEE International Symposium on High-Performance Comp Architecture 2012
DOI: 10.1109/hpca.2012.6168952
|View full text |Cite
|
Sign up to set email alerts
|

BulkSMT: Designing SMT processors for atomic-block execution

Abstract: Multiprocessor architectures that continuously execute atomic blocks (or chunks) of instructions can improve performance and software productivity. However, all of the prior proposals for such architectures assume single-context cores as building blocksrather than the widely-used Simultaneous Multithreading (SMT) cores. As a result, they are underutilizing hardware resources.This paper presents the first SMT design that supports continuous chunked (or transactional) execution of its contexts. Our design, calle… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0

Year Published

2013
2013
2024
2024

Publication Types

Select...
5
1

Relationship

2
4

Authors

Journals

citations
Cited by 14 publications
(11 citation statements)
references
References 27 publications
0
11
0
Order By: Relevance
“…Intra-Warp Conflict Resolution. Qian et al [29] described a method to detect and resolve conflicts among threads running on SMT CPU cores, which usually have far fewer threads per core in comparison to GPU cores. The relatively small number of threads allows their design to dedicate explicit storage to record the dependency between transactions and extend each cache line to record read-sharer information.…”
Section: Sensitivity To Core Scalingmentioning
confidence: 99%
“…Intra-Warp Conflict Resolution. Qian et al [29] described a method to detect and resolve conflicts among threads running on SMT CPU cores, which usually have far fewer threads per core in comparison to GPU cores. The relatively small number of threads allows their design to dedicate explicit storage to record the dependency between transactions and extend each cache line to record read-sharer information.…”
Section: Sensitivity To Core Scalingmentioning
confidence: 99%
“…Currently, there are several proposals that support conflict serialization of atomic blocks, most notably DATM [20], SONTM [4], BulkSMT [18], and Waitn-GoTM [16]. However, they all fall short of the goal of this paper.…”
Section: Introductionmentioning
confidence: 92%
“…There are a few proposals that support conflict serialization of atomic blocks, such as DATM [20], SONTM [4], BulkSMT [18], and Wait-n-GoTM [16]. While each of these schemes advances the state-of-the-art in some directions, none provides an effective approach usable in a distributed directorybased protocol for both software-demarcated atomic blocks (e.g., in TM) and hardware-initiated blocks (e.g., in speculation for strict memory consistency).…”
Section: Existing Proposalsmentioning
confidence: 99%
See 1 more Smart Citation
“…The SVC design requires conditional gang invalidation, an uncommon architectural feature which requires circuit-level changes (additional transistors) to the SRAM cell. In a similar vein, BulkSMT [Qian et al 2012] describes a design for supporting simultaneous speculative contexts on a multithreaded core using cache line annotations requiring conditional gang clearing logic based on comparison between multi-bit fields. The Memory Disambiguation Table (MDT) design [Krishnan and Torrellas 1999] requires caches that support both a speculative write-back mode and a nonspeculative write-through mode.…”
Section: Related Workmentioning
confidence: 99%