Proceedings of the 37th Annual International Symposium on Computer Architecture 2010
DOI: 10.1145/1815961.1815996
|View full text |Cite
|
Sign up to set email alerts
|

Thread tailor

Abstract: Extracting performance from modern parallel architectures requires that applications be divided into many different threads of execution. Unfortunately selecting the appropriate number of threads for an application is a daunting task. Having too many threads can quickly saturate shared resources, such as cache capacity or memory bandwidth, thus degrading performance. On the other hand, having too few threads makes inefficient use of the resources available. Beyond static resource assignment, the program inputs… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
6
0

Year Published

2012
2012
2021
2021

Publication Types

Select...
4
4
1

Relationship

0
9

Authors

Journals

citations
Cited by 63 publications
(6 citation statements)
references
References 36 publications
0
6
0
Order By: Relevance
“…For example, if there are eight tasks and four cores, then each core will perform two tasks, and the total time will be roughly the amount of time needed to perform two tasks on a single core. In contrast, nine tasks would take much more time, because the last would execute alone [87].…”
Section: Load Balancingmentioning
confidence: 99%
“…For example, if there are eight tasks and four cores, then each core will perform two tasks, and the total time will be roughly the amount of time needed to perform two tasks on a single core. In contrast, nine tasks would take much more time, because the last would execute alone [87].…”
Section: Load Balancingmentioning
confidence: 99%
“…Speculatively executing multiple threads add significant overhead in the former. Few solutions aim to minimize resource utilization using work-stealing [5,25] and other proposals target to avoid oversubscription in [7,20]. These solutions require extensive offline profiling and are not effective on new unseen programs.…”
Section: Related Workmentioning
confidence: 99%
“…There has, in fact, been significant work from the compiler and runtime communities in improving workloadaware thread selection. Schemes that try to combine offline models with runtime tuning [11,14,16,20] can exploit prior knowledge of the program but are limited by the assumptions of the environment.…”
Section: Introductionmentioning
confidence: 99%
“…A work stealing application can also shutdown existing threads while preserving program correctness, so long as one thread is left executing. Additionally, automatic methods exist to combine program threads into a single instruction stream [4].…”
Section: Offline Model Construction Dynamic Configuration Of Multithrmentioning
confidence: 99%