In this article, we describe and analyze the computational performance of a parallel shallow water equation (SWE) solver for atmospheric simulation using radial basis function-finite difference (RBF-FD) methods. The inherent "meshless" nature of RBF-FD methods provides significant numerical benefits over standard pseudospectral and traditional FD methods, but there are many challenges in terms of their performance and parallel implementation, due to RBF-FDs use of relatively large halos and unstructured indexing. With the use of reverse Cuthill-McKee node ordering and tiled transposition of the state variable matrices and RBF-FD differentiation matrices, these challenges were overcome. The RBF-FD solver was implemented for the SWE on the rotating sphere using message passing interface plus OpenMP/OpenACC to demonstrate scalability and performance portability on the three currently dominant high performance computing (HPC) architectures, namely, Intel Xeon multicore, Intel Xeon Phi manycore, and NVIDIA graphics processing unit systems.