“…Similarly, in order to infer the accurate location of the target object, a tracker needs to take changes of several appearance (illumination change, blurriness, occlusion) and dynamic (expanding, shrinking, aspect ratio change) properties into account. Although visual tracking research has achieved remarkable advances in the past decades [21-23, 32, 38-40], and thanks to deep learning especially in the recent years [6,8,29,35,36,41], most methods employ only a subset of these properties, or are too slow to perform in real-time.…”