2022
DOI: 10.1016/j.parco.2022.102893
|View full text |Cite
|
Sign up to set email alerts
|

OpenACC + Athread collaborative optimization of Silicon-Crystal application on Sunway TaihuLight

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(4 citation statements)
references
References 9 publications
0
4
0
Order By: Relevance
“…In [49], the authors introduced a novel sequence alignment technique called ESA. This algorithm is implemented on the Sunway TaihuLight architecture and is capable of performing both local and global alignment.…”
Section: Literature Reviewmentioning
confidence: 99%
“…In [49], the authors introduced a novel sequence alignment technique called ESA. This algorithm is implemented on the Sunway TaihuLight architecture and is capable of performing both local and global alignment.…”
Section: Literature Reviewmentioning
confidence: 99%
“…To verify the effectiveness and universality of this scheme, a core group of the SW26010p multi-core processor is used as the test platform for this experiment. The computational tasks are loaded asynchronously to the slave core for execution with the help of the high-performance threading library Athread [25]. The following schemes will be tested in this experiment: Validity test of the bracketing memory allocation algorithm; implementation of serial SpMV algorithm based on main kernel; implementation of SpMV algorithm based on master-slave acceleration; implementation of x optimization algorithm based on slave architecture LRU and LRU-K and x access optimization algorithm based on slave architecture ARC.…”
Section: Experimental Environment and Experimental Schemementioning
confidence: 99%
“…9,10,18 Our work was mainly carried out on this basis, and we completed the parallel acceleration of OpenACC based bulk silicon MD simulation program on Sunway TaihuLight, 19 and then further improved its performance by using OpenACC+Athread. 20 Ami Marowka points out that the 3P challenges of high-performance programming-performance, portability, and productivity-have become more difficult than ever in the era of heterogeneous computing. 21 Directives strive to offer portability without losing performance and are one of the most portable and productive programming models.…”
Section: Related Surveys and Our Contributionsmentioning
confidence: 99%
“…To give full play to the advantages of multi‐scale parallel computing mode, Hou et al independently developed an efficient and highly scalable MD simulation program for crystalline silicon, and carried out large‐scale heterogeneous parallel computing tests on Mole8.5 and Tianhe‐1A 9,10,18 . Our work was mainly carried out on this basis, and we completed the parallel acceleration of OpenACC based bulk silicon MD simulation program on Sunway TaihuLight, 19 and then further improved its performance by using OpenACC+Athread 20 …”
Section: Introductionmentioning
confidence: 99%