Recent Advances in Stochastic Gradient Descent in Deep Learning

Tian, Yingjie; Zhang, Yuqi; Zhang, Haibin

doi:10.3390/math11030682

Cited by 72 publications

(31 citation statements)

References 47 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…The learning rate was 0.005. Optimisation utilised the stochastic gradient descent method ( Tian et al, 2023 ) as this is computationally faster and can converge quicker than other optimisation algorithms. The loss function used was cross-entropy loss.…”

Section: Methodsmentioning

confidence: 99%

Deep learning classification of systemic sclerosis from multi-site photoplethysmography signals

Iqbal,

Bacardit,

Griffiths

et al. 2023

Front. Physiol.

View full text Add to dashboard Cite

Introduction: A pilot study assessing a novel approach to identify patients with Systemic Sclerosis (SSc) using deep learning analysis of multi-site photoplethysmography (PPG) waveforms (“DL-PPG”).Methods: PPG recordings having baseline, unilateral arm pressure cuff occlusion and reactive hyperaemia flush phases from 6 body sites were studied in 51 Controls and 20 SSc patients. RGB scalogram images were obtained from the PPG, using the continuous wavelet transform (CWT). 2 different pre-trained convolutional neural networks (CNNs, namely, GoogLeNet and EfficientNetB0) were trained to classify the SSc and Control groups, evaluating their performance using 10-fold stratified cross validation (CV). Their classification performance (i.e., accuracy, sensitivity, and specificity, with 95% confidence intervals) was also compared to traditional machine learning (ML), i.e., Linear Discriminant Analysis (LDA) and K-Nearest Neighbour (KNN).Results: On a participant basis DL-PPG accuracy, sensitivity and specificity for GoogLeNet were 83.1 (72.3–90.9), 75.0 (50.9–91.3) and 86.3 (73.7–94.3)% respectively, and for EfficientNetB0 were 87.3 (77.2–94.0), 80.0 (56.3–94.3) and 90.1 (78.6–96.7)%. The corresponding results for ML classification using LDA were 66.2 (53.9–77.0), 65.0 (40.8–84.6) and 66.7 (52.1–79.2)% respectively, and for KNN were 76.1 (64.5–85.4), 40.0 (19.1–63.9), and 90.2 (78.6–96.7)% respectively.Discussion: This study shows the potential of DL-PPG classification using CNNs to detect SSc. EfficientNetB0 gave an overall improved performance compared to GoogLeNet, with both CNNs performing better than the traditional ML methods tested. Our automatic AI approach, using transfer learning, could offer significant benefits for SSc diagnostics in a variety of clinical settings where low-cost portable and easy-to-use diagnostics can be beneficial.

show abstract

Section: Methodsmentioning

confidence: 99%

Deep learning classification of systemic sclerosis from multi-site photoplethysmography signals

Iqbal,

Bacardit,

Griffiths

et al. 2023

Front. Physiol.

View full text Add to dashboard Cite

show abstract

“…To better understand the REMR algorithm and its main steps, its pseudo-codes are introduced in Algorithm 1. REMR does not really include an approximation process by hyperparameter optimization, which is usually performed via gradient descent algorithms; this is more like the submerging of deep networks [11]. Therefore, the traditional equations of the loss function will not be useful in evaluating its convergence behavior at every round.…”

Section: 𝑦 ̃𝑘+1 = 𝑓(𝑥 𝑘+1 )|𝑘 = 1 → 𝑚mentioning

confidence: 99%

What Are Recurrent Expansion Algorithms? Exploring a Deeper Space than Deep Learning

Berghout

Benbouzid

2023

Iocma 2023

View full text Add to dashboard Cite

show abstract

“…Since the advent of deep learning and neural networks, there have been studies of numerical optimization algorithms from the perspective of machine learning. In particular, the first-order algorithms, like gradient descent and such, have been widely used in the machine learning and data analysis research [1,29,37,39,55,59,61]. While first-order algorithms enjoy from being memory efficient, low-cost per iteration and simple to implement, they are also notoriously difficult to fine tune and slow in convergence, especially when the functions are not well-conditioned.…”

Section: Related Workmentioning

confidence: 99%

Convergence of Newton-MR under Inexact Hessian Information

Liu¹,

Roosta²

2021

SIAM J. Optim.

View full text Add to dashboard Cite

We consider extensions of the Newton-MR algorithm for nonconvex optimization, proposed in [43], to the settings where Hessian information is approximated. Under additive noise model on the Hessian matrix, we investigate the iteration and operation complexities of these variants to achieve first and second-order sub-optimality criteria. We show that, under certain conditions, the algorithms achieve iteration and operation complexities that match those of the exact variant. Focusing on the particular nonconvex problems satisfying Polyak-Lojasiewicz condition, we show that our algorithm achieves a linear convergence rate. We finally compare the performance of our algorithms with several alternatives on a few machine learning problems.

show abstract

Recent Advances in Stochastic Gradient Descent in Deep Learning

Cited by 72 publications

References 47 publications

Deep learning classification of systemic sclerosis from multi-site photoplethysmography signals

Deep learning classification of systemic sclerosis from multi-site photoplethysmography signals

What Are Recurrent Expansion Algorithms? Exploring a Deeper Space than Deep Learning

Convergence of Newton-MR under Inexact Hessian Information

Contact Info

Product

Resources

About