2022
DOI: 10.1109/tpds.2021.3105994
|View full text |Cite
|
Sign up to set email alerts
|

Design and Performance Characterization of RADICAL-Pilot on Leadership-Class Platforms

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
23
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
2
1

Relationship

3
5

Authors

Journals

citations
Cited by 28 publications
(23 citation statements)
references
References 35 publications
0
23
0
Order By: Relevance
“…High-throughput docking was implemented using RADICAL-Pilot (RP) and RAPTOR. 21 RP is a pilot-enabled runtime system, while RAPTOR is a scalable master/worker overlay developed to improve the execution performance of many short-running tasks encoded as Python functions. The runs used between 128 and 7000 concurrent nodes.…”
Section: Methodsmentioning
confidence: 99%
“…High-throughput docking was implemented using RADICAL-Pilot (RP) and RAPTOR. 21 RP is a pilot-enabled runtime system, while RAPTOR is a scalable master/worker overlay developed to improve the execution performance of many short-running tasks encoded as Python functions. The runs used between 128 and 7000 concurrent nodes.…”
Section: Methodsmentioning
confidence: 99%
“…In the past few months, we have progressed substantially with the implementation of our workflow. RCT is now fully functional on Summit [ 55 , 73 , 74 ] as well as Theta [ 75 ], in addition to several other HPC resources. It has successfully executed workloads at 95% usage on these machines.…”
Section: Workflow Managementmentioning
confidence: 99%
“…It has successfully executed workloads at 95% usage on these machines. We characterized scaling performance of various components of our workflow using up to 392 000 cores and 24 582 GPUs to execute 24 552 heterogeneous executable tasks and 126 × 10 6 python function tasks [ 74 ]. Recently, we have been able to achieve a performance of 144 M h −1 docking hits screening approximately 10 11 ligands using over 8000 compute nodes, which is better than the previous best by a factor of two [ 76 ].…”
Section: Workflow Managementmentioning
confidence: 99%
See 1 more Smart Citation
“…We integrate RADICAL-Pilot (RP) [4], [5], a pilot system that enables the execution of multi-task, heterogeneous workloads on diverse HPC platforms, and Parsl [6], a parallel scripting library for Python that provides workflow and dataflow capabilities. The result is a looselycoupled system that does not require a dedicated code repository but only an interface with a small code footprint.…”
Section: Introductionmentioning
confidence: 99%