2011
DOI: 10.1002/cpe.1859
|View full text |Cite
|
Sign up to set email alerts
|

Performance study on CUDA GPUs for parallelizing the local ensemble transformed Kalman filter algorithm

Abstract: SUMMARYModern graphics cards provide computational capabilities that exceed current CPUs. As one of the computational intensive problems, numerical weather prediction has the opportunity to benefit from the massive number of threads and large memory throughput in the graphics architecture. In this paper, we present the key steps to integrate the Compute Unified Device Architecture (CUDA) programming framework for one key component in numerical weather prediction, the data assimilation algorithm, which incorpor… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
5
0

Year Published

2011
2011
2024
2024

Publication Types

Select...
5

Relationship

0
5

Authors

Journals

citations
Cited by 7 publications
(5 citation statements)
references
References 7 publications
0
5
0
Order By: Relevance
“…GKF-UI algorithm begins with reading the initial conditions at time k = 0 and partitioning H k and G k into submatrices as shown in Equations (8) and (9). The initial value of the submatrix H k−1 is computed using the Kalman gain (Block 3) given in Equation (10). The error covariance matrices In addition, it is worth mentioning that the major applications of the GKF-UI 12 is for the cases that not all the external excitation data is available or cannot be measured; therefore, traditional Kalman filter (KF) algorithms 3 are not utilizable.…”
Section: Of 17mentioning
confidence: 99%
See 1 more Smart Citation
“…GKF-UI algorithm begins with reading the initial conditions at time k = 0 and partitioning H k and G k into submatrices as shown in Equations (8) and (9). The initial value of the submatrix H k−1 is computed using the Kalman gain (Block 3) given in Equation (10). The error covariance matrices In addition, it is worth mentioning that the major applications of the GKF-UI 12 is for the cases that not all the external excitation data is available or cannot be measured; therefore, traditional Kalman filter (KF) algorithms 3 are not utilizable.…”
Section: Of 17mentioning
confidence: 99%
“…9 4. Popular approaches 5,10 usually parallelize the discrete-time Kalman filter algorithm only by parallelizing matrix operations, and they are unable to achieve optimal performance due to the partial parallelization of the algorithm. However, there are few other techniques in the literature that almost completely parallelize the Kalman filter algorithm, 4,11 but only for special cases.…”
Section: Introductionmentioning
confidence: 99%
“…Other data-assimilation methods have also been subject to GPU-accelerations. Blattner and Yang [13] give a performance study of a GPU-implementation of the local ensemble transform Kalman filter, Wei and Huang [14] explore a GPU-based implementation of the EKF, and Quinn and Abarbanel [15] present a general path integral Monte Carlo approach applied to a neuron model. They all report massive speed-ups on the order 100-1000 over CPU implementations.…”
Section: Related Workmentioning
confidence: 99%
“…The speedup reached 1386x and the parallel implementation of extended Kalman filter will serve as good reference on real large-scale applications. Blattner [64] utilized the CUDA programming framework in numerical weather prediction by the data assimilation method, local ensemble transformed Kalman filter algorithm. Results show that an improvement of 72.1 × speedup and provide attractive evidence for applying CUDA GPUs to high demanding scientific computation realms.…”
Section: Introductionmentioning
confidence: 99%