A multiclass 3D object recognition has perceived a numerous evolution with respect to both accuracy and speed. This study introduces the implementation of modern YOLO algorithms (YOLOv3, YOLOv4, and YOLOv5) for multiclass 3D object detection and recognition. All YOLO algorithms have been tested according to a very large scaled dataset (Pascal VOC dataset). Performance evaluation has targeted the calculation of the following metrics; mAP (mean average precision), recall, F1-score, IOU (intersection over union), and the running time. Experimental results demonstrate that the YOLOv3 has targeted mAP of 77%, IOU of 0.41 and the total running time was almost 8 h. Moreover, in YOLOv4, it has targeted mAP of 55%, IOU of 0.035 and the total running time nearly 7 h. In addition, YOLOv5 has established the mAP of 48%, IOU of 0.045, and the total running time was about 3 h. Finally, a modified version of YOLOv5 has been proposed in the state-of-the-art of optimizing its hyperparameters and layering system. Accordingly, the mAP scored about 55% with 3 h running time. The final conclusions of this study have demonstrated that YOLOv3 has scored the highest recognition accuracy, however, the proposed modified YOLOv5 has scored the least processing time.
Human activity recognition has become an expansive field of interest in recent years, both in academic and industrial research. Human Activity recognition (HAR) is concerning the prediction of person's movement or action such as walking, standing, sitting, up and downstairs, etc. Convolutional neural network (CNN) is a key component with deep learning. The main objective of this study is to design and implement an activity recognition algorithm in the state of the art of deep learning systems which accomplish superlative performance to detect human activities. Accordingly, an extensive comparison has been developed between different deep learning algorithms such as classical (CNN) models and Recurrent Neural Network (RNN) models with respect to the major human activities. Furthermore, a robust (CNN) deep learning model has been built up and proposed in order to enhance the recognition precision of human activities. This proposed model uses raw data acquired from a set of inertial sensor and exploring numerous human achas been built up and proposedtivities; sitting, standing, jogging, walking, and etc. Experimental results show that the precision of the proposed deep structure has achieved 97.5% with respect to the NAdam optimizer which would be considered as the most effectively recognizer compared to other deep learning architectures.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.