“…Recently, deep neural networks [4,6,10,32,35,41,44,58,68,71,72,76] have become mainstream in the task of crowd counting and have made remarkable progress. To acquire better performance, most of the state-of-the-art methods [13,28,31,36,40,62,66] utilized heavy backbone networks (such as the VGG model [56]) to extract features.…”