2021
DOI: 10.1186/s12911-021-01691-8
|View full text |Cite
|
Sign up to set email alerts
|

Comparison of RetinaNet, SSD, and YOLO v3 for real-time pill identification

Abstract: Background The correct identification of pills is very important to ensure the safe administration of drugs to patients. Here, we use three current mainstream object detection models, namely RetinaNet, Single Shot Multi-Box Detector (SSD), and You Only Look Once v3(YOLO v3), to identify pills and compare the associated performance. Methods In this paper, we introduce the basic principles of three object detection models. We trained each algorithm o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
28
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
3
2

Relationship

0
10

Authors

Journals

citations
Cited by 119 publications
(46 citation statements)
references
References 22 publications
1
28
0
Order By: Relevance
“…Additionally, YOLOv4-tiny has a higher FPS, which leads to faster performance [24,33]. Due to the fact that YOLOv4-tiny is a modified version of YOLOv3, its accuracy increased [34], and YOLOv3 already outperformed SSD and faster R-CNN [35]. A mobile client-server application for price tag data verification was also developed based on the study's results.…”
Section: Discussionmentioning
confidence: 99%
“…Additionally, YOLOv4-tiny has a higher FPS, which leads to faster performance [24,33]. Due to the fact that YOLOv4-tiny is a modified version of YOLOv3, its accuracy increased [34], and YOLOv3 already outperformed SSD and faster R-CNN [35]. A mobile client-server application for price tag data verification was also developed based on the study's results.…”
Section: Discussionmentioning
confidence: 99%
“…For example, YOLO is considered among the fastest object recognition methods. However, R-CNN is often more accurate; for technical comparison between these methods, the reader is referred to [40] . But overall, this progress in the development of object recognition methods has also helped solve some complex CV tasks such as detecting and tracking objects of interest in video footage [41][42][43] , which is an important and very common task in robotic surgery.…”
Section: Figurementioning
confidence: 99%
“…SSD has improved versions such as Deconvolutional SSD (DSSD) that includes large-scale context in object detection, Rainbow SSD (RSSD) that concatenates different feature maps using deconvolution and batch normalisation [ 73 ], and Feature-fusion SSD (FSSD) that balances semantic and positional information using bilinear interpolation to resize feature maps to the same size to be subsequently concatenated [ 74 ]. The comparison of different architectures for real-time applications presented in [ 75 ] also mentions RetinaNet because it has higher accuracy, but it is not recommended for real-time applications, as it has a frame rate lower than 25 frames per second (FPS). EdgeEye [ 76 ] proposes an edge computing framework to analyse real-time video with a mean of 55 FPS as inference speed.…”
Section: Vision Modulementioning
confidence: 99%