Object Detection Algorithm for High Resolution Images Based on Convolutional Neural Network and Multiscale Processing

Bohush, Rykhard; Ablameyko, Sergey; Ihnatsyeva, S. A.; Adamovskiy, Yahor

doi:10.32782/cmis/2864-12

Cited by 6 publications

(2 citation statements)

References 9 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…It has a small model magnitude, a rapid detection speed, and is better suited to being promoted to some edge or mobile end devices. The technique proposed by Bohusha et al [40] for recognizing objects in 4K and 8K images and it has a great efficiency in recognizing small objects in 4K and 8K quality images. Kadadi et al [41] shows how to use the background subtraction (BGS) method to find and follow the intended moving objects (MOs).…”

Section: Research Contributionsmentioning

confidence: 99%

Object Classification and Tracking Using Scaled P8 YOLOv4 Lite Model

Shaikh

Chopade

Kharate

2023

Period. Polytech. Elec. Eng. Comp. Sci.

View full text Add to dashboard Cite

One of the most difficult tasks in the area of computer vision is object detection, which combines object categorization and object location within a scene. In terms of object detection, Deep Neural Networks have been recently demonstrated to outperform alternative approaches. The issues related deep learning neural network is its complexity and huge computation, so it is not possible to detect and track the objects in image of high resolution in real time. We proposed scaled YOLOv4 lite model as Single Stage Detector Neural Network for object detection, tracking and it is trained using COCO 2017 dataset. To create the YOLOv4-CSP- P5- P6- P7- P8 networks, the Scaled YOLOv4 applied efficient network scaling strategies. The additional layer in YOLOv4 lite model is added as P8 layer which improves accuracy. Cross-stage-partial (CSP) connections and Mish activation are used in improved network design, such as backbone optimization and Neck (PAN). In the case of YOLOv4, however, it can only be trained once for all resolutions. Width and Height activations have been changed, allowing for faster network training. With YOLOv4 lite model, we used CSPDarkNet-53 model as a backbone. The experimental result show our YOLOv4 lite model can detect and track object up to 28 fps when model run with the video input and has an accuracy of 86.09% when tested on real-time video with resolutions 1920 × 1080 (full HD). AP = 50.81%, AP @50 = 63.6%, and AP @75 = 52.5% for CSPDarkNet-53 model backbone.

show abstract

Section: Research Contributionsmentioning

confidence: 99%

Object Classification and Tracking Using Scaled P8 YOLOv4 Lite Model

Shaikh

Chopade

Kharate

2023

Period. Polytech. Elec. Eng. Comp. Sci.

View full text Add to dashboard Cite

show abstract

“…Large objects in the input image can be detected, but small objects are difficult to detect because the characteristic parts for identifying the objects are also shrunk. Dividing the input image into several parts of a limited size can also be done to prevent shrinkage of the characteristic parts [21,22,23,24,25,26], but this means large objects that straddle the divided images cannot be detected because the characteristic parts are also divided. As another approach, a coarse-to-fine-based inference scheme for object detection has been proposed [27,28].…”

Section: Introductionmentioning

confidence: 99%

High-definition technology of AI inference scheme for object detection on edge/terminal

Uzawa

Yoshida

Iinuma

et al. 2023

IEICE Electron. Express

View full text Add to dashboard Cite

To detect a wide range of objects with one camera at once, real-time object detection in high-definition video is required in video artificial intelligence (AI) applications for edge/terminal, such as beyond-visual-line-of-sight (BVLOS) drone flight. Although various AI inference schemes for object detection (e.g., you-only-look-once (YOLO)) have been proposed, they typically have limitations on the input image size and thus need to shrink the input high-definition image down to the limit. This makes small objects collapsed and undetectable. This paper presents our proposal technology for solving this problem and its effective implementation, where multiple object detectors cooperate to detect small and large objects in high-definition video such as full HD and 4K.

show abstract