Nonlinear domain decomposition (DD) methods, such as, e.g., ASPIN (Additive Schwarz Preconditioned Inexact Newton), RASPEN (Restricted Additive Schwarz Preconditioned Inexact Newton), Nonlinear-FETI-DP, or Nonlinear-BDDC methods, can be reasonable alternatives to classical Newton-Krylov-DD methods for the solution of sparse nonlinear systems of equations, e.g., arising from a discretization of a nonlinear partial di↵erential equation. These nonlinear DD approaches are often able to e↵ectively tackle unevenly distributed nonlinearities and outperform Newton's method with respect to convergence speed as well as global convergence behavior. Furthermore, they often improve parallel scalability due to a superior ratio of local to global work. Nonetheless, as for linear DD methods, it is often necessary to incorporate an appropriate coarse space in a second level to obtain numerical scalability for increasing numbers of subdomains. In addition to that, an appropriate coarse space can also improve the nonlinear convergence of nonlinear DD methods. In this paper, four variants how to integrate coarse spaces in nonlinear Schwarz methods in an additive or multiplicative way using Galerkin projections are introduced. These new variants can be interpreted as natural nonlinear equivalents to well-known linear additive and hybrid twolevel Schwarz preconditioners. Furthermore, they facilitate the use of various coarse spaces, e.g., coarse spaces based on energy-minimizing extensions, which can easily be used for irregular domain decompositions, as, e.g., obtained by graph partitioners. In particular, Multiscale Finite Element Method (MsFEM) type coarse spaces are considered, and it is shown that they outperform classical approaches for certain heterogeneous nonlinear problems. The new approaches are then compared with classical Newton-Krylov-DD and nonlinear onelevel Schwarz approaches for di↵erent homogeneous and heterogeneous model problems based on the p-Laplace operator.