We present a graphics processing unit (GPU) parallelization of the computation of the price of exotic crosscurrency interest rate derivatives via a partial differential equation (PDE) approach. In particular, we focus on the GPU-based parallel pricing of long-dated foreign exchange (FX) interest rate hybrids, namely power reverse dual currency (PRDC) swaps with Bermudan cancelable features. We consider a three-factor pricing model with FX volatility skew, which results in a time-dependent parabolic PDE in three spatial dimensions. Finite difference methods on uniform grids are used for the spatial discretization of the PDE, and the alternating direction implicit (ADI) technique is employed for the time discretization. We then exploit the parallel architectural features of GPUs together with the Compute Unified Device Architecture framework to design and implement an efficient parallel algorithm for pricing PRDC swaps. Over each period of the tenor structure, we divide the pricing of a Bermudan cancelable PRDC swap into two independent pricing subproblems, each of which can efficiently be solved on a GPU via a parallelization of the ADI timestepping technique. Numerical results indicate that GPUs can provide significant increase in performance over CPUs when pricing PRDC swaps. An analysis of the impact of the FX skew on such derivatives is provided. PRDC swaps. The use of a local volatility model provides better modeling for the skewness of the FX rate and at the same time avoids introducing more stochastic factors into the model.The ever growing interest in cross-currency interest rate derivatives in general, and PRDC swaps in particular, has created a need for efficient pricing and hedging strategies for them. The popular choice for pricing PRDC swaps is Monte-Carlo (MC) simulation, but this approach has several major disadvantages, including slow convergence, and the limitation that the price is obtained at a single point only in the domain as opposed to the global character of the partial differential equation (PDE) approach. In addition, accurate hedging parameters, such as delta and gamma, are generally harder to compute via an MC approach than via a PDE approach. On the other hand, when pricing PRDC swaps by the PDE approach, each stochastic factor in the pricing model gives rise to a spatial variable in the PDE. Because of the 'curse of dimensionality' associated with highdimensional PDEs-not to mention additional complexity due to multiple cash flows and exotic features, such as Bermudan cancelability-the pricing of such derivatives via the PDE approach is not only mathematically challenging but also very computationally intensive.Over the last few years, the rapid evolution of graphics processing units (GPUs) into powerful, cost efficient, programmable computing architectures for general purpose computations has provided application potential beyond the primary purpose of graphics processing. In the area of computational finance, although there has been great interest in utilizing GPUs in developing eff...