“…In recent years, with the advancement of deep learning and object detection, online tracking has attracted more and more attention. On contrary to offline methods, online methods usually adopt the Hungarian algorithm for data association, but focus on the joint learning of object detection and some useful priors, such as object motions [1], [7], [35], appearance features [4], [36], [37], occlusion maps [36], object poses [38] and so on. However, except for the annotation of box and category ID, extra annotations are required for the learning of these priors, e.g., object identity for appearance feature learning.…”