2023
DOI: 10.1109/tip.2023.3268561
|View full text |Cite
|
Sign up to set email alerts
|

Neural Attention-Driven Non-Maximum Suppression for Person Detection

Abstract: Non-maximum suppression (NMS) is a postprocessing step in almost every visual object detector. NMS aims to prune the number of overlapping detected candidate regionsof-interest (ROIs) on an image, in order to assign a single and spatially accurate detection to each object. The default NMS algorithm (GreedyNMS) is fairly simple and suffers from severe drawbacks, due to its need for manual tuning. A typical case of failure with high application relevance is pedestrian/person detection in dense human crowds, wher… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
4

Relationship

0
8

Authors

Journals

citations
Cited by 11 publications
(3 citation statements)
references
References 44 publications
0
3
0
Order By: Relevance
“…Soft-NMS is an improved loss function used to suppress redundant bounding boxes more smoothly during the NMS process [30]. Previously, the traditional NMS method used a fixed threshold to determine if two bounding boxes overlap and to suppress them [31]. However, due to the fixed threshold, it may adapt to situations between different objects, resulting in some candidate boxes with low confidence but which incorrectly exclude significantly overlapping true objects.…”
Section: Soft-nms Loss Functionmentioning
confidence: 99%
“…Soft-NMS is an improved loss function used to suppress redundant bounding boxes more smoothly during the NMS process [30]. Previously, the traditional NMS method used a fixed threshold to determine if two bounding boxes overlap and to suppress them [31]. However, due to the fixed threshold, it may adapt to situations between different objects, resulting in some candidate boxes with low confidence but which incorrectly exclude significantly overlapping true objects.…”
Section: Soft-nms Loss Functionmentioning
confidence: 99%
“…When predicting the signal positions for candidate regions, the model generates multiple prediction boxes, which inevitably raises the question of how to eliminate redundant predictions. Most approaches rely on post-processing steps, such as non-maximum suppression (NMS) [8] or its related improved algorithms [9], to filter and retain the most probable box among multiple potential predictions for the same target by setting a threshold. However, there is significant variation in the aspect ratios of signals in the spectrogram, making it challenging for well-designed prior anchors to match different signals.…”
Section: Introductionmentioning
confidence: 99%
“…The input plane receives images of characters with a standardized size and centeredness, and each layer's units receive input from a set of smaller units located in the previous layer [11]. Two unique attributes of the architecture of CNNs are their sparse connections and shared weights [12]. CNNs exploit spatially local correlation by imposing a pattern of local connectivity between neurons of adjacent layers, where units in layer m are connected to three spatially adjacent units in layer m − 1.…”
Section: Introductionmentioning
confidence: 99%