CUDA Fortran for Scientists and Engineers 2014
DOI: 10.1016/b978-0-12-416970-8.00003-1
|View full text |Cite
|
Sign up to set email alerts
|

Optimization

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(3 citation statements)
references
References 41 publications
0
3
0
Order By: Relevance
“…Our implementation with CUDA Fortran [45] offers unprecedented speed and efficiency already visible on commodity hardware (e.g., GeForce 1080). Furthermore, it can be easily tuned for professional GPUs such as Titan V [37] virtually at no extra effort.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Our implementation with CUDA Fortran [45] offers unprecedented speed and efficiency already visible on commodity hardware (e.g., GeForce 1080). Furthermore, it can be easily tuned for professional GPUs such as Titan V [37] virtually at no extra effort.…”
Section: Discussionmentioning
confidence: 99%
“…The core components of our implementation has been written in modern Fortran 95/2003 [41], which we have chosen for its flexibility [42], extensive support for linear algebra [43], performance [44] and native support for CUDA technology [45]. To make our code easier to use, we have wrapped it in a Python package using the f2py [46] utility and numpy's fork of distutils package [47].…”
Section: Languages and Technologies Employedmentioning
confidence: 99%
“…Since the sustainable bandwidth from the host to the device (and vice versa) plays a key role in the acceleration of a single DGEMM or DTRSM calls, in the CUDA‐Aware HPL benchmark, the CUDA tool related to a fast transfer mode is exploited. Such a tool is enabled when page‐locked memory (sometimes called pinned memory 29 ) is used.…”
Section: Related Work and Motivationmentioning
confidence: 99%