High-spatial resolution urban scenes, which contain ecological, social and environmental zones, are composed of diverse functional areas. Depicting urban scenes from both qualitative and quantitative aspects, i.e., scene classification and scene unmixing, are increasingly important tasks. The purpose of scene classification is to mark a scene as a separate category, and this approach is widely applied to qualitatively define scenes. However, with the rapid development of cities, urban scenes usually show a mixed nature, resulting in urban mixed scenes (UMSs). Traditional scene classification cannot describe the mix of urban scenes. Few studies have focused on the quantitative measurements of UMSs, which aim at discovering and analyzing the proportion of different subscenes. In this paper, a scene unmixing framework based on nonnegative matrix factorization for UMSs (UnUMS) is proposed to estimate the mixing ratio of urban scenes. In the UnUMS, a deep feature representation is used for the high-dimensional representation of the image, and then a nonnegative matrix factorization strategy is used to estimate the proportion of UMSs. Finally, the UMS can be described by the proportion of each subscene. To confirm the robust generalizability of UnUMS, the WH-center Mix dataset with a set of small mixed scenes and the WH-hy LMix dataset with a large UMS image are utilized. The root-mean-square error and the newly proposed vector angular distance are utilized to quantitatively evaluate the effect of scene unmixing from different aspects. Experiments with the two datasets demonstrate that UnUMS can accurately estimate the proportions of different scenes for UMS.