2013
DOI: 10.1007/s00607-013-0329-x
|View full text |Cite
|
Sign up to set email alerts
|

An integrated fine-grain runtime system for MPI

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
7
0

Year Published

2014
2014
2021
2021

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 11 publications
(7 citation statements)
references
References 14 publications
0
7
0
Order By: Relevance
“…All of the remaining processes are configured to be free processes. These free processes are all blocked on a receive call and FG-MPI's runtime scheduler [11] ensures that they remain on a blocked queue and do not add any overhead while blocked. Skip list processes make free node requests to the co-located manager process which cooperates with the other managers to find a free process.…”
Section: Fine-grain Mpimentioning
confidence: 99%
See 1 more Smart Citation
“…All of the remaining processes are configured to be free processes. These free processes are all blocked on a receive call and FG-MPI's runtime scheduler [11] ensures that they remain on a blocked queue and do not add any overhead while blocked. Skip list processes make free node requests to the co-located manager process which cooperates with the other managers to find a free process.…”
Section: Fine-grain Mpimentioning
confidence: 99%
“…A crucial element of our design is the use of Fine-Grain MPI [11] (FG-MPI). FG-MPI extends MPI and makes it possible to express and exploit finer-grain, function-level concurrency and parallelism by allowing for multiple MPI processes inside an OS-process.…”
Section: Introductionmentioning
confidence: 99%
“…Kamal et al [18] make use of User Level Threads (ULT) in the MPICH 2 [12] to build an MPI-aware scheduler for coroutines that are swapped in and out for execution depending on the status of the MPI runtime. Lu et al [21] follow a similar approach by doing the context switch of ULTs inside the MPI to avoid the expensive MPI locking operations.…”
Section: Related Workmentioning
confidence: 99%
“…The increasing complexity of Multiprocessor System-on-Chip (MPSoC) drives the needs for system software development. To exploit the computation capability of MPSoC, fine-grained task models like Intel's TBB [1], Cilk++ [2], Fine-grain MPI(FG-MPI) [3] and Simulink [4] have been proposed to expose the computation parallelism, which provides more chances for system performance optimization, including easier load balancing, greater potential for overlapping communication and computation, and improved platformindependence [5].…”
Section: Introductionmentioning
confidence: 99%
“…However, it is unclear which tasks and how many cycles of each task to be preprocessed, which requires theoretical guidance for users. For the scheduling challenge, existing scheduling approaches on fine-grained models [1]- [3], [5] mainly focus on runtime implementation, but design-time (i.e. static) approaches are also important.…”
Section: Introductionmentioning
confidence: 99%