“…Beyond learning with less supervision, most recent works [45,44,54,1,23,29,38,48,37,42] focus on the object localization task using self-supervised or unsupervised learning that does not require any human annotated labels. These works address the problem of identifying which regions are more likely to contain the foreground object, which is a salient object in an image.…”