“…The bounded gradient assumption is a common assumption for the analysis of DP-SGD algorithms , Zhou et al, 2020a and also frequently used in general adaptive gradient methods such as Adam [Reddi et al, 2021, Chen et al, 2018, Reddi et al, 2018. One recent popular approach to relax this assumption is using the gradient clipping method [Chen et al, 2020, Andrew et al, 2019, Pichapati et al, 2019, which we will discuss more in Section 6 as well as in Appendix A.…”