Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis 2019
DOI: 10.1145/3295500.3356200
|View full text |Cite
|
Sign up to set email alerts
|

Optimizing the data movement in quantum transport simulations via data-centric parallel programming

Abstract: Designing efficient cooling systems for integrated circuits (ICs) relies on a deep understanding of the electro-thermal properties of transistors. To shed light on this issue in currently fabricated Fin-FETs, a quantum mechanical solver capable of revealing atomicallyresolved electron and phonon transport phenomena from firstprinciples is required. In this paper, we consider a global, datacentric view of a state-of-the-art quantum transport simulator to optimize its execution on supercomputers. The approach yi… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
2
1
1

Relationship

2
2

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 20 publications
0
3
0
Order By: Relevance
“…A critical view on using NEGF for nanoelectronic devices is given in [48]. Some representative examples of NEGF simulation tools are, e.g., Omen [49], NEMO5 [50], NESS [51], Atomistix [52], NanoTCAD ViDES [53], and Victory Atomistic [54].…”
Section: Nonequilibrium Green's Functions (Keldysh Formalism)mentioning
confidence: 99%
“…A critical view on using NEGF for nanoelectronic devices is given in [48]. Some representative examples of NEGF simulation tools are, e.g., Omen [49], NEMO5 [50], NESS [51], Atomistix [52], NanoTCAD ViDES [53], and Victory Atomistic [54].…”
Section: Nonequilibrium Green's Functions (Keldysh Formalism)mentioning
confidence: 99%
“…Using fission and data layout transformations, we reshape the job into a stencillike strided-batched GEMM operation, where the DaCe implementation yields up to 4.8× speedup over cuBLAS. The full implementation details and transformations are described by Ziogas et al [30]. Below, we highlight the innovations that led to the most significant performance improvements.…”
Section: Innovations Realizedmentioning
confidence: 99%
“…• Load Balancing: Similar to OMEN, we divide work among electrons and phonons unevenly, so as to reduce imbalance. [30]. Below, we highlight the innovations that led to the most significant performance improvements.…”
Section: Innovations Realizedmentioning
confidence: 99%