Vision tracking is a key component of a video sequence. It is the process of locating single or multiple moving objects over time using one or many cameras. The latter's function consists of detecting, categorizing, and tracking. The development of the trustworthy solution for video sequence analysis opens up new horizons for a variety of applications, including intelligent transportation systems, biomedical, agriculture, human-machine interaction, augmented reality, video surveillance, robots, and many crucial research areas. To make efficient models, there are challenges in video observation to deal with, such as problems with the environment, light variation, pose variation, motion blur, clutter, occlusion, and so on. In this paper, we present several techniques that addressed the issues of detecting and tracking multiple targets on video sequences. The proposed comparative study relied on different methodologies. This paper's purpose is to list various approaches, classify them, and compare them, using the Weighted Scoring Model (WSM) comparison method. This includes studying these algorithms, selecting relevant comparison criteria, assigning weights for each criterion, and lastly computing scores. The obtained results of this study will reveal the strong and weak points of each algorithm mentioned and discussed.