“…Additionally, spatial-temporal context [86] and kernel tricks [27] are used to improve the learning formulation with the consideration of local appearance and nonlinear metric, respectively. The DCF paradigm has further been extended by exploiting scale detection [41,14,16], structural patch analysis [42,46,45], multi-clue fusion [71,50,28,4,72], sparse representation [88,90], support vector machine [75,92], enhanced sampling mechanisms [89,54] and end-to-end deep neural networks [73,67].…”