2017
DOI: 10.1007/s10766-017-0541-y
|View full text |Cite
|
Sign up to set email alerts
|

Efficient Dynamic Pinning of Parallelized Applications by Distributed Reinforcement Learning

Abstract: Abstract. This paper introduces a resource allocation framework specifically tailored for addressing the problem of dynamic placement (or pinning) of parallelized applications to processing units. Under the proposed setup each thread of the parallelized application constitutes an independent decision maker (or agent), which (based on its own prior performance measurements and its own prior CPU-affinities) decides on which processing unit to run next. Decisions are updated recursively for each thread by a resou… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
11
0

Year Published

2017
2017
2020
2020

Publication Types

Select...
2
2
1

Relationship

2
3

Authors

Journals

citations
Cited by 5 publications
(11 citation statements)
references
References 12 publications
0
11
0
Order By: Relevance
“…This work extends prior work of the authors [8] in two directions: (a) we introduce a new type of reinforcement-learning dynamics that admits faster adjustment towards better allocations, and (b) evaluation is performed over a real-world application, that is a parallelized implementation of the Ant-Colony Optimization metaheuristic.…”
Section: Introductionmentioning
confidence: 78%
See 2 more Smart Citations
“…This work extends prior work of the authors [8] in two directions: (a) we introduce a new type of reinforcement-learning dynamics that admits faster adjustment towards better allocations, and (b) evaluation is performed over a real-world application, that is a parallelized implementation of the Ant-Colony Optimization metaheuristic.…”
Section: Introductionmentioning
confidence: 78%
“…In comparison to [8], the difference lies in the reinforcement direction. As Equation (4) dictates, the strategy vector is only adjusted when a performance is higher than the running-average performanceū i , which provides a faster adjustment towards better assignments.…”
Section: Each Thread Is Assigned a Performance Index That Coincides Wmentioning
confidence: 87%
See 1 more Smart Citation
“…Recognizing this need for both learning-and distributedbased optimization, and contrary to the aforementioned references on pinning of parallelized applications, our earlier work [6], [7] proposed a scheduling scheme for optimally allocating threads of a parallelized application that combines both a learning-and a distributed-based optimization. It requires a minimum information exchange, where only measurements collected from each running thread are needed.…”
Section: Related Work and Contributionsmentioning
confidence: 99%
“…In our previous work [6], [7], we have proposed a reinforcement-learning-based distributed scheduling framework (PaRLSched), adapted to Uniform Memory Architectures (UMA). In this paper, our goal is to provide a generalized methodology that also extends to Non-Uniform Memory Architectures (NUMA).…”
Section: Introductionmentioning
confidence: 99%