2013
DOI: 10.1007/s11227-013-1065-x
|View full text |Cite
|
Sign up to set email alerts
|

Performance analysis of SSE and AVX instructions in multi-core CPUs and GPU computing on FDTD scheme for solid and fluid vibration problems

Abstract: In this work a unified treatment of solid and fluid vibration problems is 9 developed by means of the Finite-Di↵erence Time-Domain (FDTD). The scheme 10 here proposed takes advantage from a scaling factor in the velocity fields that im-11 proves the performance of the method and the vibration analysis in heterogenous 12 media. Moreover, the scheme has been extended in order to simulate both the The Journal of Supercomputing. 2014Supercomputing. , 70(2): 514-526. doi:10.1007Supercomputing. /s11227-013-1065

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2015
2015
2023
2023

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 8 publications
(6 citation statements)
references
References 18 publications
0
6
0
Order By: Relevance
“…It is worth to note that the scaling factor and the relationship between spatial and time resolutions have been carefully chosen in order to avoid rounding and finite precision errors [15]. The usage of double precision was experimentally proven not to improve significantly the accuracy of the results obtained but it implied a dramatic downside effect in terms of computational resources.…”
Section: Multi-cpu Approach Of the Fdtd Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…It is worth to note that the scaling factor and the relationship between spatial and time resolutions have been carefully chosen in order to avoid rounding and finite precision errors [15]. The usage of double precision was experimentally proven not to improve significantly the accuracy of the results obtained but it implied a dramatic downside effect in terms of computational resources.…”
Section: Multi-cpu Approach Of the Fdtd Methodsmentioning
confidence: 99%
“…Regarding this aspect, some works related with GPU computing and FDTD in the field of Electromagnetics have been developed [11][12][13]. For FDTD and GPU computing applied to vibration problems there are some contributions related with seismology [14] and also for vibroacoustics [15]. The application of multi-GPU has been applied to FDTD and Electromagnetics in [16,17] but an accurate performance analysis of multi-CPU FDTD code that uses SSE and AVX instructions compared to a multi-GPU version with Peer-to-Peer communication has not been carried out to the best of our knowledge.…”
Section: Introductionmentioning
confidence: 99%
“…In order to achieve this reduction parallel strategies have been considered. 11,20,21 Basically, the auto-vectorisation performed by the compiler and OpenMP have been considered. For enabling properly the auto-vectorisation an efficient memory alignment, a correct loop count and proper structures were considered.…”
Section: One-dimensional Nonlinear Coated Binary Gratingmentioning
confidence: 99%
“…Vectorization is the process by which the implementation of an algorithm is converted from scalar to vectorial such that one single operation is executed over a group of contiguous values, all at the same time. In our particular case, the vectorization only applies to large floating point operations (inner loop) [23]. Thus, when the loops are collapsed, the granularity is reduced and the vectorization could not be applied; that is, by collapsing the loops, computing is insufficient for vectorization.…”
Section: Experiments On Cpu the Configuration Of The Worktation Ismentioning
confidence: 99%