Nonlinear Hyperparameter Optimization of a Neural Network in Image Processing for Micromachines

Shen, Mingming; Yang, Jing; Li, Shaobo; Zhang, Ansi; Bai, Qiang

doi:10.3390/mi12121504

Cited by 8 publications

(6 citation statements)

References 28 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Te head_dim is the dimensionality of the multihead attention-based hidden layer of the model, which afects the performance and computational power of the model when the number of heads is a fxed value (generally 8). Additionally, we use dropout in the multihead attention layer to reduce the multihead attention weight to reduce overftting and improve the running speed of the model, and we set the parameter of the dropout function to 0.3-0.5 with reference to the literature [57]. Terefore, to obtain better entity recognition performance from the FLAT model, we conducted exploratory experiments on the optimization algorithm, learning rate (lr), number of model encoding layers, and head_dim with the CLUENER2020 dataset.…”

Section: Flat Optimization With Sgdm and Hyperparametermentioning

confidence: 99%

“…Unlike the original FLAT model, we fnd the optimal model by experimenting with optimization models such as SGD and Adam and diferent parameters (layers, lr, head_dim); then, the experimentally derived optimal model is compared with the more classic NER method to verify the superiority of the resulting model. Additionally, the parameter of the dropout function is set with reference to the literature [57] as 0.3-0.5, and the multihead attention matrix is changed to a sparse matrix by using dropout in the multihead attention layer to improve the running speed of our model. Other parameter settings are derived from the literature [52].…”

Section: Comparison Experiments With the Baseline Modelsmentioning

confidence: 99%

See 1 more Smart Citation

A TL_FLAT Model for Chinese Text Datasets of UAV Power Systems: Optimization and Performance

Shen

Yang

et al. 2023

International Journal of Intelligent Systems

Self Cite

View full text Add to dashboard Cite

The manufacturing processes of unmanned aerial vehicle (UAV) power systems generate large amounts of data and knowledge. The extraction of useful information or patterns from redundant data and knowledge texts has become a challenge in intelligent manufacturing. Unfortunately, graphics processing unit (GPU)-based parallel computing is limited, and the inference speeds of the available named entity recognition (NER) models for Chinese text datasets are low because they are mainly based on the long short-term memory (LSTM) algorithm. Herein, first, the flat-lattice transformer (FLAT) model was optimized by using a stochastic gradient descent with momentum (SGDM) optimizer and adjusting the model hyperparameters. Compared with the existing NER methods, the proposed optimization algorithm achieved better performance on the available dataset. Then, an NER method named the TL_FLAT model based on transfer learning and the abovementioned optimization model was introduced. Finally, a Chinese text dataset from a UAV power system created by the authors was used to validate the proposed method. The F1 score was 76.26%, the precision value was 76.98%, and the recall value was 75.56%, indicating that the TL_FLAT model was suitable for Chinese text entity recognition for UAV power systems.

show abstract

Section: Flat Optimization With Sgdm and Hyperparametermentioning

confidence: 99%

Section: Comparison Experiments With the Baseline Modelsmentioning

confidence: 99%

A TL_FLAT Model for Chinese Text Datasets of UAV Power Systems: Optimization and Performance

Shen

Yang

et al. 2023

International Journal of Intelligent Systems

Self Cite

View full text Add to dashboard Cite

show abstract

“…It provides an overview of various CNN approaches utilized for image classification, segmentation, and styling. Next, in [19], the mathematical relationship between four hyperparameters, namely learning rate, batch size, dropout rate, and convolution kernel size were investigated in detail. A generalized multi-parameter mathematical correlation approach was derived, showing that the hyperparameters play a vital part in the efficiency of the NN models.…”

Section: Related Workmentioning

confidence: 99%

An Automated Hyperparameter Tuning Recurrent Neural Network Model for Fruit Classification

et al. 2022

View full text Add to dashboard Cite

Automated fruit classification is a stimulating problem in the fruit growing and retail industrial chain as it assists fruit growers and supermarket owners to recognize variety of fruits and the status of the container or stock to increase business profit and production efficacy. As a result, intelligent systems using machine learning and computer vision approaches were explored for ripeness grading, fruit defect categorization, and identification over the last few years. Recently, deep learning (DL) methods for classifying fruits led to promising performance that effectively extracts the feature and carries out an end-to-end image classification. This paper introduces an Automated Fruit Classification using Hyperparameter Optimized Deep Transfer Learning (AFC-HPODTL) model. The presented AFC-HPODTL model employs contrast enhancement as a pre-processing step which helps to enhance the quality of images. For feature extraction, the Adam optimizer with deep transfer learning-based DenseNet169 model is used in which the Adam optimizer fine-tunes the initial values of the DenseNet169 model. Moreover, a recurrent neural network (RNN) model is utilized for the identification and classification of fruits. At last, the Aquila optimization algorithm (AOA) is exploited for optimal hyperparameter tuning of the RNN model in such a way that the classification performance gets improved. The design of Adam optimizer and AOA-based hyperparameter optimizers for DenseNet and RNN models show the novelty of the work. The performance validation of the presented AFC-HPODTL model is carried out utilizing a benchmark dataset and the outcomes report the promising performance over its recent state-of-the-art approaches.

show abstract

“…A dense-layer neuron conducts matrix-vector multiplication over the input of each neuron in the previous layer. As indicated in the Equation, the usual formula for matrix-vector multiplication is as follows (7) [ 37 ]:

…”

Section: Densenet-169 For Spectrogram Classificationmentioning

confidence: 99%

AAL and Internet of Medical Things for Monitoring Type-2 Diabetic Patients

et al. 2022

View full text Add to dashboard Cite

Due to an aging population, assisted-care options are required so that senior citizens may maintain their independence at home for a longer time and rely less on caretakers. Ambient Assisted Living (AAL) encourages the creation of solutions that can help to optimize the environment for senior citizens with assistance while greatly reducing their challenges. A framework based on the Internet of Medical Things (IoMT) is used in the current study for the implementation of AAL technology to help patients with Type-2 diabetes. A glucose oxide sensor is used to monitor diabetic elderly people continuously. Spectrogram images are created from the recorded data from the sensor to assess and detect aberrant glucose levels. DenseNet-169 examines and analyzes the spectrogram pictures, and messages are sent to caregivers when aberrant glucose levels are detected. The current work describes both the spectrogram image analysis and the signal-to-spectrogram generating method. The study presents a future perspective model for a mobile application for real-time patient monitoring. Benchmark metrics evaluate the application’s performances, including sensitivity, specificity, accuracy, and F1-score. Several cross--validations are used to evaluate the model’s performance. The findings demonstrate that the proposed model can correctly identify patients with abnormal blood glucose levels.

show abstract

Nonlinear Hyperparameter Optimization of a Neural Network in Image Processing for Micromachines

Cited by 8 publications

References 28 publications

A TL_FLAT Model for Chinese Text Datasets of UAV Power Systems: Optimization and Performance

A TL_FLAT Model for Chinese Text Datasets of UAV Power Systems: Optimization and Performance

An Automated Hyperparameter Tuning Recurrent Neural Network Model for Fruit Classification

AAL and Internet of Medical Things for Monitoring Type-2 Diabetic Patients

Contact Info

Product

Resources

About