“…Visual object tracking is an important topic in computer vision, where the target object is identified in the first frame and tracked in all frames of a video. Due to the significant learning ability, deep convolutional neural networks (DCNNs) have been widely used to object detection [35,34,62], image matting [43,42,64], super-resolution [68,67,63], image enhancement [61,65] and visual object tracking [2,15,19,28,32,72,38,13,70,71,22,12,11,33,58,47]. However, RGB-based trackers suffer from bad environmental conditions, e.g., low illumination, fast motion, and so on.…”