Enhancing UAV Detection in Surveillance Camera Videos through Spatiotemporal Information and Optical Flow

Sun, Yu; Zhi, Xiyang; Han, Haowen; Jiang, Shikai; Shi, Tianjun; Gong, Jinnan; Zhang, Wei

doi:10.3390/s23136037

Cited by 9 publications

(5 citation statements)

References 47 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Detecting drones in surveillance footage is challenging due to their small size, low contrast, and bird similarity. To solve this problem, researchers propose using the following drone detection techniques: Deep machine learning [ 251 , 252 ]; Deep convolutional neural network (DC-CNN) (DC-CNN) [ 253 ]; Spatiotemporal information and optical flow [ 254 ]; Radio frequency (RF) [ 255 , 256 , 257 ]; Sensors that measure the sound emitted by the UAV [ 258 ]; The transformer network [ 259 ]; The “fisheye” camera system [ 260 ]. …”

Section: Resultsmentioning

confidence: 99%

Risks of Drone Use in Light of Literature Studies

Tubis,

Poturaj,

Dereń

et al. 2024

Sensors

View full text Add to dashboard Cite

This article aims to present the results of a bibliometric analysis of relevant literature and discuss the main research streams related to the topic of risks in drone applications. The methodology of the conducted research consisted of five procedural steps, including the planning of the research, conducting a systematic review of the literature, proposing a classification framework corresponding to contemporary research trends related to the risk of drone applications, and compiling the characteristics of the publications assigned to each of the highlighted thematic groups. This systematic literature review used the PRISMA method. A total of 257 documents comprising articles and conference proceedings were analysed. On this basis, eight thematic categories related to the use of drones and the risks associated with their operation were distinguished. Due to the high content within two of these categories, a further division into subcategories was proposed to illustrate the research topics better. The conducted investigation made it possible to identify the current research trends related to the risk of drone use and pointed out the existing research gaps, both in the area of risk assessment methodology and in its application areas. The results obtained from the analysis can provide interesting material for both industry and academia.

show abstract

Section: Resultsmentioning

confidence: 99%

Risks of Drone Use in Light of Literature Studies

Tubis,

Poturaj,

Dereń

et al. 2024

Sensors

View full text Add to dashboard Cite

show abstract

“…Although these methods have made some progress in enhancing the model's generalizability and dealing with complex monitoring environments, specific challenges unique to parking lot monitoring, such as extreme changes in light intensity and high-density target occlusion, still need to be addressed. Therefore, to further improve the recognition accuracy and real-time performance of parking lot monitoring systems, some studies have begun exploring new avenues, such as [50] introducing continuous image sequences and frame-to-frame optical flow processing methods to simulate human visual mechanisms and [42,51] aiming to enhance the detection capability for small moving targets by improving model structures and loss functions. These innovative methods have significantly improved the performance of monitoring models under specific conditions, but their universality and robustness in actual parking lot monitoring applications, especially in dealing with multi-target occlusion and extreme weather conditions in image capture, remain key issues for current research to explore in depth.…”

Section: Optimization Strategies For Object Detection Models In Compl...mentioning

confidence: 99%

CMCA-YOLO: A Study on a Real-Time Object Detection Model for Parking Lot Surveillance Imagery

Zhao,

Wang,

Yang

et al. 2024

Electronics

View full text Add to dashboard Cite

In the accelerated phase of urbanization, intelligent surveillance systems play an increasingly pivotal role in enhancing urban management efficiency, particularly in the realm of parking lot administration. The precise identification of small and overlapping targets within parking areas is of paramount importance for augmenting parking efficiency and ensuring the safety of vehicles and pedestrians. To address this challenge, this paper delves into and amalgamates cross-attention and multi-spectral channel attention mechanisms, innovatively designing the Criss-cross and Multi-spectral Channel Attention (CMCA) module and subsequently refining the CMCA-YOLO model, specifically optimized for parking lot surveillance scenarios. Through meticulous analysis of pixel-level contextual information and frequency characteristics, the CMCA-YOLO model achieves significant advancements in accuracy and speed for detecting small and overlapping targets, exhibiting exceptional performance in complex environments. Furthermore, the study validates the research on a proprietary dataset of parking lot scenes comprising 4502 images, where the CMCA-YOLO model achieves an mAP@0.5 score of 0.895, with a pedestrian detection accuracy that surpasses the baseline model by 5%. Comparative experiments and ablation studies with existing technologies thoroughly demonstrate the CMCA-YOLO model’s superiority and advantages in handling complex surveillance scenarios.

show abstract

“…Second, this method cannot identify drone targets larger than 32 × 32 pixels because it can only extract the edge motion features of the target, which can lead to the incorrect positioning of the target. In addition to the aforementioned approach, several researchers [15][16][17] have proposed utilizing multi-frame information to enhance model performance. However, these methods suffer from issues such as excessive computational steps or a substantial increase in calculations.…”

Section: Introductionmentioning

confidence: 99%

An Efficient Adjacent Frame Fusion Mechanism for Airborne Visual Object Detection

Ye,

Peng,

Liu

et al. 2024

Drones

View full text Add to dashboard Cite

With the continuous advancement of drone technology, drones are demonstrating a trend toward autonomy and clustering. The detection of airborne objects from the perspective of drones is critical for addressing threats posed by aerial targets and ensuring the safety of drones in the flight process. Despite the rapid advancements in general object detection technology in recent years, the task of object detection from the unique perspective of drones remains a formidable challenge. In order to tackle this issue, our research presents a novel and efficient mechanism for adjacent frame fusion to enhance the performance of visual object detection in airborne scenarios. The proposed mechanism primarily consists of two modules: a feature alignment fusion module and a background subtraction module. The feature alignment fusion module aims to fuse features from aligned adjacent frames and key frames based on their similarity weights. The background subtraction module is designed to compute the difference between the foreground features extracted from the key frame and the background features obtained from the adjacent frames. This process enables a more effective enhancement of the target features. Given that this method can significantly enhance performance without a substantial increase in parameters and computational complexity, by effectively leveraging the feature information from adjacent frames, we refer to it as an efficient adjacent frame fusion mechanism. Experiments conducted on two challenging datasets demonstrate that the proposed method achieves superior performance compared to existing algorithms.

show abstract

Enhancing UAV Detection in Surveillance Camera Videos through Spatiotemporal Information and Optical Flow

Cited by 9 publications

References 47 publications

Risks of Drone Use in Light of Literature Studies

Risks of Drone Use in Light of Literature Studies

CMCA-YOLO: A Study on a Real-Time Object Detection Model for Parking Lot Surveillance Imagery

An Efficient Adjacent Frame Fusion Mechanism for Airborne Visual Object Detection

Contact Info

Product

Resources

About