2014
DOI: 10.1109/map.2014.6837098
|View full text |Cite
|
Sign up to set email alerts
|

Porting an explicit time-domain volume-integral-equation solver on gpus with openacc [open problems in cem]

Abstract: Graphics processing units (GPUs) are gradually becoming mainstream in high-performance computing, as their capabilities for enhancing performance of a large spectrum of scientifi c applications to many fold when compared to multi-core CPUs have been clearly identifi ed and proven. In this paper, implementation and performance-tuning details for porting an explicit marching-on-in-time (MOT)-based time-domain volume-integral-equation (TDVIE) solver onto GPUs are described in detail. To this end, a high-level app… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
1
0
1

Year Published

2016
2016
2019
2019

Publication Types

Select...
3
1

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(2 citation statements)
references
References 14 publications
0
1
0
1
Order By: Relevance
“…Moreover, CPU parallelized [23] and GPU accelerated [24] implementations are developed to further advance the capability of the solver. That said, the applicability of this solver to the real-life scattering problems is still limited by its high computational complexity, i.e.,…”
Section: > Replace This Line With Your Paper Identification Number (Dmentioning
confidence: 99%
“…Moreover, CPU parallelized [23] and GPU accelerated [24] implementations are developed to further advance the capability of the solver. That said, the applicability of this solver to the real-life scattering problems is still limited by its high computational complexity, i.e.,…”
Section: > Replace This Line With Your Paper Identification Number (Dmentioning
confidence: 99%
“…Abordagens como OpenMP 4.0 [52], OpenACC [53], OCCA [54] e HMPP [55], que utilizam um único código portável com anotações, ainda não atingem o desempenho obtido ao codificar diretamente com a API mais adequada a cada dispositivo [56,57,58,59]. Ou seja, para se obter o maior desempenho em uma determinada arquitetura é preciso utilizar a API que melhor se adapta àquela arquitetura.…”
Section: Programação De Arquiteturas Heterogêneasunclassified