On the Relative Impact of Optimizers on Convolutional Neural Networks with Varying Depth and Width for Image Classification

Dogo, Eustace M.; Afolabi, Oluwatobi J.; Twala, Bhekisipho

doi:10.3390/app122311976

Cited by 9 publications

(6 citation statements)

References 34 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The suggests that extracted features and the classifier were invariant against internal implementation and thus suitable for the task. It is worth pointing out that, on the contrary, selecting different loss functions and optimizers in a CNN-based DL could vastly affect its performance [21][22]. Finally, automated kernel scaling was activated for all kernels.…”

Section: Resultsmentioning

confidence: 99%

Non-Destructive Inspection of Tile Debonding by DWT and MFCC of Tile-Tapping Sound with Machine versus Deep Learning Models

Panyavaraporn,

Horkaew

2024

ECTI-CIT Transactions

View full text Add to dashboard Cite

One of the essential processes of construction quality control is tile bonding inspection. Hollows beneath tile tessellation can lead to unbounded or completely broken tiles. An interior inspector typically used a hollowsounding technique. However, it relies on skill and judgment that greatly vary among individuals. Moreover, equipment and interpretation are difficult to calibrate and standardize. This paper addresses these issues by employing machine-learning strategies for tile-tapping sound classification. Provided that a tapping signal was digitally acquired, the proposed method was fully computerized. Firstly, the signal was analyzed and its wavelets and MFCC were extracted. The corresponding spectral features were then classified by SVM, k-NN, Naïve Bayes, and Logistic Regression algorithm, in turn. The results were subsequently compared against those from a previous works that employed a deep learning strategy. It was revealed that when the proposed method was properly configured, it required much less computing resources than the deep learning based one, while being able to distinguish dull from hollow sounding tiles with 93.67% accuracy.

show abstract

Section: Resultsmentioning

confidence: 99%

Non-Destructive Inspection of Tile Debonding by DWT and MFCC of Tile-Tapping Sound with Machine versus Deep Learning Models

Panyavaraporn,

Horkaew

2024

ECTI-CIT Transactions

View full text Add to dashboard Cite

show abstract

“…Among the most commonly used optimizers in the various prediction and classification cases, SGD, RMSprop, Adadelta, and Adam were selected for study. Therefore, we expanded our analysis using Adam, Adadelta, RMSprop, and SGD optimizers [10,[27][28][29][33][34][35][36]. We considered the merits of each optimizer, including the SGD [33], RMSprop [29], Adam [37], and Adadelta [29] formulae.…”

Section: Optimizer Learning Rate and Batch Sizementioning

confidence: 99%

PPG Signals-Based Blood-Pressure Estimation Using Grid Search in Hyperparameter Optimization of CNN–LSTM

Fuadah

Jeong

et al. 2023

Diagnostics

View full text Add to dashboard Cite

Researchers commonly use continuous noninvasive blood-pressure measurement (cNIBP) based on photoplethysmography (PPG) signals to monitor blood pressure conveniently. However, the performance of the system still needs to be improved. Accuracy and precision in blood-pressure measurements are critical factors in diagnosing and managing patients’ health conditions. Therefore, we propose a convolutional long short-term memory neural network (CNN–LSTM) with grid search ability, which provides a robust blood-pressure estimation system by extracting meaningful information from PPG signals and reducing the complexity of hyperparameter optimization in the proposed model. The multiparameter intelligent monitoring for intensive care III (MIMIC III) dataset obtained PPG and arterial-blood-pressure (ABP) signals. We obtained 75,226 signal segments, with 60,180 signals allocated for training data, 12,030 signals allocated for the validation set, and 15,045 signals allocated for the test data. During training, we applied five-fold cross-validation with a grid-search method to select the best model and determine the optimal hyperparameter settings. The optimized configuration of the CNN–LSTM layers consisted of five convolutional layers, one long short-term memory (LSTM) layer, and two fully connected layers for blood-pressure estimation. This study successfully achieved good accuracy in assessing both systolic blood pressure (SBP) and diastolic blood pressure (DBP) by calculating the standard deviation (SD) and the mean absolute error (MAE), resulting in values of 7.89 ± 3.79 and 5.34 ± 2.89 mmHg, respectively. The optimal configuration of the CNN–LSTM provided satisfactory performance according to the standards set by the British Hypertension Society (BHS), the Association for the Advancement of Medical Instrumentation (AAMI), and the Institute of Electrical and Electronics Engineers (IEEE) for blood-pressure monitoring devices.

show abstract

“…P is an argument responsible for keeping the spatial sizes fixed after the convolution operation by adding columns and rows of zero values. P has two types valid (without padding) and the same (with zero padding) [16,17]. Moreover, D is a hyper-parameter that adjusts the moving averages.…”

Section: Ssd's Hyper-parametersmentioning

confidence: 99%

Untitled

2023

J. Med. Chem. Sci.

View full text Add to dashboard Cite

Lung diseases significantly impact the world regarding health, economic cost, and social and psychological well-being. X-ray images are a primary method for diagnosing lung diseases, but the manual analysis of these images can be time-consuming, subjective, and prone to inaccuracies. However, it is essential to diagnose lung diseases in a timely manner and with high accuracy to ensure effective treatment and management. This study introduces an innovative deep-learning version termed the "ESSDN-LN model" to overcome these challenges. It is a variant of the single shot detector (SSD) network. This model aims to rapidly and accurately detect and classify six types of lung disease: aortic enlargement, cardiomegaly, pleural thickening, pulmonary fibrosis, COVID-19, and pneumonia. The ESSDN-LD model was introduced in three versions: ESSDN-LDV1, ESSDN-LDV2, and ESSDN-LDV3. ESSDN-LDV1 incorporates the SSD with batch normalization, dropout regularization, and data augmentation techniques. ESSDN-LDV2 builds upon the advancements of ESSDN-LDV1 by incorporating the random search algorithm for adjusting model hyperparameters and introducing the skip connections technique to enhance the detection performance. Furthermore, ESSDN-LDV3 further enhances the capabilities of ESSDN-LDV1 using the genetic algorithm for hyperparameter tuning and incorporating feature fusion and skip connections techniques, thereby significantly improving the detection performance. The ESSDN-LDV3 model demonstrated exceptional performance compared to other versions, achieving a remarkable accuracy of 96.5% and a prediction time of 0.018 seconds in the seven-class classification. Furthermore, it achieved a total accuracy of 98.4% and a prediction time of 0.013 seconds in the three-class classification, encompassing Covid-19, pneumonia, and no-finding cases. These impressive results highlight the effectiveness and efficiency of the proposed method in accurately classifying lung diseases and can contribute to improved patient outcomes and treatment decisions.

show abstract

On the Relative Impact of Optimizers on Convolutional Neural Networks with Varying Depth and Width for Image Classification

Cited by 9 publications

References 34 publications

Non-Destructive Inspection of Tile Debonding by DWT and MFCC of Tile-Tapping Sound with Machine versus Deep Learning Models

Non-Destructive Inspection of Tile Debonding by DWT and MFCC of Tile-Tapping Sound with Machine versus Deep Learning Models

PPG Signals-Based Blood-Pressure Estimation Using Grid Search in Hyperparameter Optimization of CNN–LSTM

Untitled

Contact Info

Product

Resources

About