A Performance Study of a Dual Xeon-Phi Cluster for the Forward Modelling of Gravitational Fields

Couder-Castañeda, Carlos; Trujillo-Alcántara, Alfredo; Herrera-Diaz, Israel-Enrique; Vera-Chavez, Nain

doi:10.1155/2015/316012

Cited by 4 publications

(6 citation statements)

References 18 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…With respect to the computing architecture, it is well known that a deep acquaintance of the architecture leads to a very efficient parallel solution to the problem, which in turn becomes complicated because it requires programming skills at low level and a large development (coding) time. This is the case of using CUDA C for Graphics Processing Units (GPU) or Message Passing Interface (MPI) for memory distributed architectures like clusters or Xeon Phi Co-processors [48,52,53]. As one of the goals of this paper is to make this research accessible to the greatest possible part of the geophysical and related communities, the acceleration of the herein presented algorithms are based on shared memory CPU-based architectures.…”

Section: Parallel Implementation Of Forward Modellingmentioning

confidence: 99%

“…Thus, a deficient design of the parallel strategy is prone to a deficient performance. OpenMP has benefited several geophysical problems [48,52]. OpenMP also provides an implicit parallelism model which produces medium granularity tasks for MT applications, providing a higher computational abstraction than MPI.…”

Section: Parallel Implementation Of Forward Modellingmentioning

confidence: 99%

“…A graphical representation can be observed in Figure 8. In this work, we only consider the large salt body emplaced in the center of the domain, with respect to other bodies in the zone [52]. For the magnetic properties of the model, we considered that there is no variation of the susceptibility with depth.…”

Section: Initial Model Configurationmentioning

confidence: 99%

See 2 more Smart Citations

OpenMP Implementation of a Novel Potential-Field-Data Source-Growth-Based Inversion Approach for 3D Salt Imaging in Deepwater Gulf of Mexico

et al. 2020

Self Cite

View full text Add to dashboard Cite

Potential-field-data imaging of complex geological features in deepwater salt-tectonic regions in the Gulf of Mexico remains an open active research field. There is still a lack of resolution in seismic imaging methods below and in the surroundings of allochthonous salt bodies. In this work, we present a novel three-dimensional potential-field-data simultaneous inversion method for imaging of salt features. This new approach incorporates a growth algorithm for source estimation, which progressively recovers geological structures by exploring a constrained parameter space; restrictions are posed from a priori geological knowledge of the study area. The algorithm is tested with synthetic data corresponding to a real complex salt-tectonic geological setting commonly found in exploration areas of deepwater Gulf of Mexico. Due to the huge amount of data involved in three-dimensional inversion of potential field data, the use of parallel computing techniques becomes mandatory. In this sense, to alleviate computational burden, an easy to implement parallelization strategy for the inversion scheme through OpenMP directives is presented. The methodology was applied to invert and integrate gravity, magnetic and full tensor gradient data of the study area.

show abstract

Section: Parallel Implementation Of Forward Modellingmentioning

confidence: 99%

Section: Parallel Implementation Of Forward Modellingmentioning

confidence: 99%

See 1 more Smart Citation

OpenMP Implementation of a Novel Potential-Field-Data Source-Growth-Based Inversion Approach for 3D Salt Imaging in Deepwater Gulf of Mexico

et al. 2020

Self Cite

View full text Add to dashboard Cite

show abstract

“…An advantage of the usage of the presented CPML ABC consists in a drastic reduction of the number of memory arrays in the bidimensional algorithm [4], so it can be easily implemented for GPU processing, as those cards possess a limited amount of memory. The CPML ABC for the FDTD method requires to allocate the value of the time derivatives in a memory variable, which is implemented in two ways in this paper: the first, by allocating the memory variables for all the domain, and the second, by allocating them only in the absorption region (see Figure 2 [19,20].…”

Section: International Journal Of Antennas and Propagationmentioning

confidence: 99%

Analysis of Electromagnetic Propagation from MHz to THz with a Memory-Optimised CPML-FDTD Algorithm

Rodríguez-Sánchez

Couder-Castañeda

Hernández-Gómez

et al. 2018

International Journal of Antennas and Propagation

View full text Add to dashboard Cite

FDTD method opened a fertile research area on the numerical analysis of electromagnetic phenomena under a wide range of media and propagation conditions, providing an extensive analysis of electromagnetic behaviour like propagation, reflection, refraction, and multitrajectory phenomena. In this paper, we present an optimised FDTD-CPML algorithm, focused in saving memory while increasing the performance of the algorithm. We particularly implement FDTD-CPML method at high frequency bands, used in several telecommunications applications as well as in nanoelectromagnetism. We show an analysis of the performance of the algorithm in single and double precision, as well as a stability of the algorithm analysis, from where we conclude that the implemented CPML ABC constitutes a robust choice in terms of precision and accuracy for the high frequencies herein considered. It is important to recall that the CPML ABC parameters provided in this paper are fixed for the tested range of frequencies, that is, from MHz to THz.

show abstract

“…To divide tasks in a balanced way, we follow the procedure described by Couder-Castañeda et al [17], and Arroyo et al [18]. Let p n be the number of MPI processes and C n the number of problems to solve, then we define the problem number with which a process, p, must start and finish as p s and p e , respectively.…”

Section: Mpi Distributed Implementationmentioning

confidence: 99%

Weakly Coupled Distributed Calculation of Lyapunov Exponents for Non-Linear Dynamical Systems

et al. 2017

Self Cite

View full text Add to dashboard Cite

Numerical estimation of Lyapunov exponents in non-linear dynamical systems results in a very high computational cost. This is due to the large-scale computational cost of several Runge-Kutta problems that need to be calculated. In this work we introduce a parallel implementation based on MPI (Message Passing Interface) for the calculation of the Lyapunov exponents for a multidimensional dynamical system, considering a weakly coupled algorithm. Since we work on an academic high-latency cluster interconnected with a gigabit switch, the design has to be oriented to reduce the number of messages required. With the design introduced in this work, the computing time is drastically reduced, and the obtained performance leads to close to optimal speed-up ratios. The implemented parallelisation allows us to carry out many experiments for the calculation of several Lyapunov exponents with a low-cost cluster. The numerical experiments showed a high scalability, which we showed with up to 68 cores.

show abstract

A Performance Study of a Dual Xeon-Phi Cluster for the Forward Modelling of Gravitational Fields

Cited by 4 publications

References 18 publications

OpenMP Implementation of a Novel Potential-Field-Data Source-Growth-Based Inversion Approach for 3D Salt Imaging in Deepwater Gulf of Mexico

OpenMP Implementation of a Novel Potential-Field-Data Source-Growth-Based Inversion Approach for 3D Salt Imaging in Deepwater Gulf of Mexico

Analysis of Electromagnetic Propagation from MHz to THz with a Memory-Optimised CPML-FDTD Algorithm

Weakly Coupled Distributed Calculation of Lyapunov Exponents for Non-Linear Dynamical Systems

Contact Info

Product

Resources

About