2016
DOI: 10.1109/tc.2015.2417533
|View full text |Cite
|
Sign up to set email alerts
|

Thread Assignment in Multicore/Multithreaded Processors: A Statistical Approach

Abstract: Abstract-The introduction of multicore/multithreaded processors, comprised of a large number of hardware contexts (virtual CPUs) that share resources at multiple levels, has made process scheduling, in particular assignment of running threads to available hardware contexts, an important aspect of system performance. Nevertheless, thread assignment of applications running on state-of-the art processors is an NP-complete problem. Over the years, numerous studies have proposed heuristic-based algorithms for thr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
12
0

Year Published

2016
2016
2023
2023

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 20 publications
(13 citation statements)
references
References 46 publications
0
12
0
Order By: Relevance
“…Each of the previous works is able to achieve the best performance for some of the benchmarks, but none of them explores enough of the search space to achieve the best for all, as we do. (PPP=Page Placement Policy, PND=Page NUMA Degree, TPP=Thread Placement Policy, NT=Number of Threads, TND=Thread NUMA Degree) Previous work PPP: [6,10,23,24,26], TPP/NT/TND: [12,17,27,29,30,34], PPP/TPP: [11]. Our work includes all optimizations and performs significantly better.…”
Section: Codelet Search Speedmentioning
confidence: 96%
See 1 more Smart Citation
“…Each of the previous works is able to achieve the best performance for some of the benchmarks, but none of them explores enough of the search space to achieve the best for all, as we do. (PPP=Page Placement Policy, PND=Page NUMA Degree, TPP=Thread Placement Policy, NT=Number of Threads, TND=Thread NUMA Degree) Previous work PPP: [6,10,23,24,26], TPP/NT/TND: [12,17,27,29,30,34], PPP/TPP: [11]. Our work includes all optimizations and performs significantly better.…”
Section: Codelet Search Speedmentioning
confidence: 96%
“…For cg and needle, one region in each (needle_176 and cg_405) can obtain the optimal performance with PPP/PND, but their other regions (needle_116 and cg_551) require the full search space to achieve the best results. Overall, our full search is able to achieve an average speedup of 2.48× (median 2.23×) compared to 1.84× (median 1.27×) for the best subset of existing TPP/NT/TND searches [12,17,27,29,30,34]. Figure 8 (top) shows the predicted speedups from optimizing the first instance of each region in the application via the codelet vs. the actual performance achieved by all instances of that region in the application with the optimization.…”
Section: Region Optimizationmentioning
confidence: 99%
“…An approach based on EVT theory has also been used for optimal performance analysis. Radojković et al [5] presented a new method for predicting the performance of the thread assignment in multi-core processors. Using statistical inference of each thread assignment in a random sample, the authors estimates the optimal one.…”
Section: Related Workmentioning
confidence: 99%
“…In the related work, a number of methods for application of EVT theory for WCET estimations has been proposed [2], [5], [6]; nevertheless, a considerable uncertainty due to the complexity of the problem still exists. Results of statistical tests are often fuzzy and it is hard to make a correct decision on their basis to fulfill EVT requirements.…”
Section: Introductionmentioning
confidence: 99%
“…when BlackBox makes scheduling decision, it considers two tables to make performance prediction and picks out the best solution. In new research [7], Radojkovicp presents the best scheduling scheme by combining Extreme Value theory and performance evaluation of historical scheduling result.…”
Section: A Historical Data Learningmentioning
confidence: 99%