“…Strong scaling: In Figure 2 (#1, #2, #3, #4, #5, #6, #7, #8, #9, #10, #11, and #12), we use a 2 24 -by-2 24 Gaussian kernel matrix generated with a synthetic 6-D point dataset. Using this matrix, we perform strong scaling experiments using up to 6,144 Skylake cores (128 compute nodes, using one MPI process per node and 48 OpenMP threads).…”