“…Fully optimized implementation for the FDTD(2,2), FDTD(2,4), and WE-FDTD(2,2) schemes gain 19.1, 20.3, and 9.3 times speedup against the initial implementation on the Intel MIC, respectively. The performance that was achieved for the FDTD(2,2), FDTD (2,4), and WE-FDTD(2,2) schemes is 45.7, 77.9, and 113 GFlops on the Intel MIC, respectively. Similarly, on the CPU, the gain in speedup was 1.7, 1.6, and 1.7 times, achieving 6.8, 10.1, and 18.4 GFlops, respectively.…”