2022
DOI: 10.48550/arxiv.2205.09121
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

On the efficiency of Stochastic Quasi-Newton Methods for Deep Learning

Abstract: While first-order methods are popular for solving optimization problems that arise in largescale deep learning problems, they come with some acute deficiencies. To diminish such shortcomings, there has been recent interest in applying second-order methods such as quasi-Newton based methods which construct Hessians approximations using only gradient information. The main focus of our work is to study the behaviour of stochastic quasi-Newton algorithms for training deep neural networks. We have analyzed the perf… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 26 publications
0
1
0
Order By: Relevance
“…Therefore, Newton's method is used to solve equations with non-differentiable function with sufficient conditions of convergence and estimates of both the speed and the range of convergence [4]. Also there was interest in applying second-order methods, such as Newton-based methods that build Hessian approximations using only gradient in-formation to study the behavior of Newtonian stochastic algorithms to train deep neural networks [5].…”
Section: Introductionmentioning
confidence: 99%
“…Therefore, Newton's method is used to solve equations with non-differentiable function with sufficient conditions of convergence and estimates of both the speed and the range of convergence [4]. Also there was interest in applying second-order methods, such as Newton-based methods that build Hessian approximations using only gradient in-formation to study the behavior of Newtonian stochastic algorithms to train deep neural networks [5].…”
Section: Introductionmentioning
confidence: 99%