A 58.6 mW 30 Frames/s Real-Time Programmable Multiobject Detection Accelerator With Deformable Parts Models on Full HD $1920\times 1080$ Videos

Suleiman, Amr; Zhang, Zhengdong; Sze, Vivienne

doi:10.1109/jssc.2017.2648820

Cited by 14 publications

(2 citation statements)

References 27 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The fixed size of cells in the conventional solutions to generate the feature pyramid leads to the requirement of more calculations, including linear interpolation in previous works such as in Ref. 29. By contrast, the size ratio between cells of increasing size is used to define the scaling factor of the corresponding pyramid level in our work.…”

Section: Algorithm For Feature Space Constructionmentioning

confidence: 99%

See 1 more Smart Citation

Flexible feature-space-construction architecture and its VLSI implementation for multi-scale object detection

Luo

Zhang

et al. 2018

Jpn. J. Appl. Phys.

View full text Add to dashboard Cite

Feature extraction techniques are a cornerstone of object detection in computer-vision-based applications. The detection performance of visonbased detection systems is often degraded by, e.g., changes in the illumination intensity of the light source, foreground-background contrast variations or automatic gain control from the camera. In order to avoid such degradation effects, we present a block-based L1-norm-circuit architecture which is configurable for different image-cell sizes, cell-based feature descriptors and image resolutions according to customization parameters from the circuit input. The incorporated flexibility in both the image resolution and the cell size for multi-scale image pyramids leads to lower computational complexity and power consumption. Additionally, an object-detection prototype for performance evaluation in 65 nm CMOS implements the proposed L1-norm circuit together with a histogram of oriented gradients (HOG) descriptor and a support vector machine (SVM) classifier. The proposed parallel architecture with high hardware efficiency enables real-time processing, high detection robustness, small chipcore area as well as low power consumption for multi-scale object detection.

show abstract

Section: Algorithm For Feature Space Constructionmentioning

confidence: 99%

“…Multi-scale and multi-object detection has become a tendency for intelligent machine-vision applications. [27][28][29] However, a multi-scale image pyramid results in large data expansion and higher computational complexity, necessitating fast hardware implementations to enable realtime processing.…”

Section: Introductionmentioning

confidence: 99%

Flexible feature-space-construction architecture and its VLSI implementation for multi-scale object detection

Luo

Zhang

et al. 2018

Jpn. J. Appl. Phys.

View full text Add to dashboard Cite

show abstract

FPGA-based implementation of classification techniques: A survey

et al. 2021

View full text Add to dashboard Cite

VLSI Tree-Based Inference Design Applications for Low-Power Learning

Nagaraju¹,

Suresh²,

Uthayakumar³

et al. 2021

J. Phys.: Conf. Ser.

View full text Add to dashboard Cite

For the decision tree ensemble, this paper suggests a hardware architecture utilizing many feature channels. The proposed work uses the complexity of function channels for rapid identification compared to parallel processing in spatial domain scheduling to achieve conflict-free system memory. The results’ analysis demonstrates that only an FPGA implementation of the new architecture with a pedestrian sensor collated channel feature will conduct 229 thousand pulses per second at an operational value of 100 MHz while providing relatively limited resources. Checking estimation systems’ electricity-accuracy trade-offs has become central. This research evaluates the nature of data sets, investigating the outcomes of design difficulty or intensity of accuracy approximation. We improved the simulations’ precision by up to 6.7 percent by quantizing the inputs to small sizes relative to specific scenarios. The gap in model complexity was more important than source distance in terms of capacity, as we achieved reductions of up to 67 percent by reducing tree depth.

show abstract

A 58.6 mW 30 Frames/s Real-Time Programmable Multiobject Detection Accelerator With Deformable Parts Models on Full HD $1920\times 1080$ Videos

Cited by 14 publications

References 27 publications

Flexible feature-space-construction architecture and its VLSI implementation for multi-scale object detection

Flexible feature-space-construction architecture and its VLSI implementation for multi-scale object detection

FPGA-based implementation of classification techniques: A survey

VLSI Tree-Based Inference Design Applications for Low-Power Learning

Contact Info

Product

Resources

About