2018
DOI: 10.1007/978-3-319-92040-5_11
|View full text |Cite
|
Sign up to set email alerts
|

Performance Optimization and Evaluation of Scalable Optoelectronics Application on Large Scale KNL Cluster

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 16 publications
0
2
0
Order By: Relevance
“…We next consider the hand-coding vectorization using ACLE. In Hirokawa et al (2018), we discussed optimization of the same calculation using Intel AVX-512 intrinsics. In SVE (Scalable Vector Extension) instructions that A64FX supports, it is possible to set the hardware SIMD length between 128-bit and 2048- bit.…”
Section: Simulation Methods and Implementationsmentioning
confidence: 99%
See 1 more Smart Citation
“…We next consider the hand-coding vectorization using ACLE. In Hirokawa et al (2018), we discussed optimization of the same calculation using Intel AVX-512 intrinsics. In SVE (Scalable Vector Extension) instructions that A64FX supports, it is possible to set the hardware SIMD length between 128-bit and 2048- bit.…”
Section: Simulation Methods and Implementationsmentioning
confidence: 99%
“…A typical single production run required 19 hours using 8000 nodes (64,000 cores). In Hirokawa et al (2018), coupled calculation of 3D Maxwell and 3D TDKS using ARTED was reported. The calculation was performed using almost all 8192 nodes of the KNL processors of Oakforest-PACS at JCAHPC, and a 12–16% performance was obtained.…”
Section: Introductionmentioning
confidence: 99%