“…For SVOS methods, the target object(s) is provided in the first frame and tracked automatically [60,8,5,68,2,69,64,71] or interactively by users [1] in the subsequent frames. Numerous algorithms were proposed based on graphical models [54], object proposals [46], supertrajectories [61], etc.…”