Nowadays, smart cities are a result of frequent technological advancements that improve the quality of life for their residents [1]. For smart cities, video surveillance is an important application in private and public sectors to monitor and protect areas at different scales which brings many problems and challenges, such as security and privacy of data that is collected from heterogenous sources [2].*Author for correspondence Furthermore, many video events, a lack of quality, a significant transmission latency for video data, and the loss of video surveillance data integrity [3]. Moreover, these systems still lack precision for realtime reactions that may be delayed. In addition, environmental variations are a constraint for any system. These obstacles are problematic for video surveillance systems. By analyzing and comprehending the recorded videos, the processing of data using several proposed algorithms may aid in avoiding these issues through analysis and comprehension [2].The smart city is a significant application of the internet of things (IoT) and is predicted to connect