2023
DOI: 10.1109/access.2023.3306951
|View full text |Cite
|
Sign up to set email alerts
|

Detection of Commodities Based on Multi-Feature Fusion and Attention Screening by Entropy Function Guidance

An Xie,
Kai Xie,
Hao-Nan Dong
et al.

Abstract: Although traditional convolutional neural networks (CNN) have been significantly improved for target detection, they cannot be completely applied to objects with occlusions in commodity detection. Therefore, we propose a target detection method based on an improved YOLOv5 model and an improved attention mechanism algorithm is proposed to solve the commodity occlusion problem. This method improves the traditional YOLO deep convolution network, features a more detailed BiFPN layer, and performs lightweight two-w… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
2

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 46 publications
0
2
0
Order By: Relevance
“…X and Y denote the two input images, respectively, µ X , µ Y denotes the mean value of the two images, σ X , σ Y denote the standard deviation of the two images, respectively, σ XY denotes the covariance of X and Y, and C 1 , C 2 is a constant. Usually, taking C 1 (K 1 * L) 2 and C 2 (K 2 * L) 2 , in general, K 1 = 0.01, K 2 = 0.03, and L = 255 (dynamic range of pixel values). SSI M takes values ranging from 0 to 1.…”
Section: Evaluation Indexmentioning
confidence: 99%
See 1 more Smart Citation
“…X and Y denote the two input images, respectively, µ X , µ Y denotes the mean value of the two images, σ X , σ Y denote the standard deviation of the two images, respectively, σ XY denotes the covariance of X and Y, and C 1 , C 2 is a constant. Usually, taking C 1 (K 1 * L) 2 and C 2 (K 2 * L) 2 , in general, K 1 = 0.01, K 2 = 0.03, and L = 255 (dynamic range of pixel values). SSI M takes values ranging from 0 to 1.…”
Section: Evaluation Indexmentioning
confidence: 99%
“…When performing merchandise detection in vision cabinets [2,3], it is usually necessary to feed the merchandise picked up by the consumer into the target detection network for detection. However, in this process, the consumer's rapid picking up of the merchandise can lead to motion blurring of the merchandise under fixed background conditions.…”
Section: Introductionmentioning
confidence: 99%