“…Weak Supervision. Weakly supervised learning has been extensively used for various problems in computer vision such as semantic segmentation [73,74,75,76,77,78], object localization [79,80,81,82], saliency detection [83,84], scene recognition [85,86] and many more. However, this form of learning has been relatively unexplored for crowd counting.…”