Fluorescence correlation spectroscopy (FCS) is a valuable tool to study the molecular dynamics of living cells. When used together with a super-resolution stimulated emission depletion (STED) microscope, STED-FCS can measure diffusion processes at the nanoscale in living cells. In twodimensional (2D) systems like the cellular plasma membrane, a ring-shaped depletion focus is most commonly used to increase the lateral resolution, leading to more than 25-fold decrease in the observation volumee, reaching the relevant scale of supramolecular arrangements. However, STED-FCS faces severe limitations when measuring diffusion in three dimensions (3D), largely due to the spurious background contributions from undepleted areas of the excitation focus that reduce the signal quality and ultimately limit the resolution. In this paper, we investigate how different STED confinement modes can mitigate this issue. By simulations as well as experiments with fluorescent probes in solution and in cells, we demonstrate that the coherent-hybrid (CH) depletion pattern reduces background most efficiently and thus provides superior signal quality under comparable reduction of the observation volume. Featuring also the highest robustness to common optical aberrations, CH-STED can be considered the method of choice for reliable STED-FCS based investigations of 3D diffusion on the sub-diffraction scale.