“…Visual object tracking (VOT) has emerged as a dynamic study area due to its utilization in a wide range of applications such as human action recognition [ 1 , 2 , 3 ], traffic monitoring [ 4 , 5 ], pellet ore phase [ 6 ], smart city [ 7 ], embedded system [ 8 ], surveillance [ 9 , 10 , 11 ] and medical diagnosis [ 12 , 13 ]. While significant progress has been made in recent years, accurate estimation for tracking an object is still a challenge in a video sequence due to various factors such as scale variations, occlusion, deformation, background clutters, to name a few [ 14 , 15 , 16 ].…”