Adaptive cubic regularization methods with dynamic inexact Hessian information and applications to finite-sum minimization

Bellavia, Stefania; Gurioli, Gianmarco; Morini, Benedetta

doi:10.1093/imanum/drz076

Cited by 30 publications

(45 citation statements)

References 42 publications

(62 reference statements)

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Results are reported in Table 4. 4. We can see that, reducing N 0 , the number of full function/gradient evaluations reduces as well, and that for N 0 = 0.01N the average classification error compares well with the error when N 0 = 0.1N ; for instance, the best results for cina0 and covertype are obtained by shrinking N 0 to 1% of the maximum sample size.…”

mentioning

confidence: 75%

“…They do not call for function evaluations but require tuning the learning rate and further possible hyper-parameters such as the mini-batch size. Since the tuning effort may be very computationally demanding [15], more sophisticated approaches use linesearch or trust-region strategies to adaptively choose the learning rate and to avoid tuning efforts, see [2,4,5,9,14,15,25]. In this context, function and gradient approximations have to satisfy sufficient accuracy requirements with some probability.…”

Section: Introductionmentioning

confidence: 99%

“…The issue of choosing the sample size such that (2.2) and (2.3) hold in probability is delicate. Dynamic strategies for choosing the sample size have been proposed, see e.g., [4,12,14,25]. Considering, for sake of brevity, the estimate…”

mentioning

confidence: 99%

“…Applying the above estimate for the sample size is not trivial since in general the upper bound V f on the variance is not available; however it can be replaced with variance estimates [12] obtained during the computation of the subsampled function values, or with estimated upper bounds on |f N (x k )| and ∇f N (x k ) [4,29]. Alternative proposals consist in increasing the sample size by a prefixed factor, or geometrically with the iteration index k by a rule of the form a k for some a > 1, [11,12].…”

mentioning

confidence: 99%

See 3 more Smart Citations

A stochastic first-order trust-region method with inexact restoration for finite-sum minimization

Bellavia¹,

Krejić²,

Morini³

et al. 2021

Preprint

Self Cite

View full text Add to dashboard Cite

We propose a stochastic first-order trust-region method with inexact function and gradient evaluations for solving finite-sum minimization problems. At each iteration, the function and the gradient are approximated by sampling. The sample size in gradient approximations is smaller than the sample size in function approximations and the latter is determined using a deterministic rule inspired by the inexact restoration method, which allows the decrease of the sample size at some iterations. The trust-region step is then either accepted or rejected using a suitable merit function, which combines the function estimate with a measure of accuracy in the evaluation. We show that the proposed method eventually reaches full precision in evaluating the objective function and we provide a worst-case complexity result on the number of iterations required to achieve full precision. We validate the proposed algorithm on nonconvex binary classification problems showing good performance in terms of cost and accuracy and the important feature that a burdensome tuning of the parameters involved is not required.

show abstract

mentioning

confidence: 75%

Section: Introductionmentioning

confidence: 99%

mentioning

confidence: 99%

mentioning

confidence: 99%

See 2 more Smart Citations

A stochastic first-order trust-region method with inexact restoration for finite-sum minimization

Bellavia¹,

Krejić²,

Morini³

et al. 2021

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

“…This paper attempts to answer a simple question: how does noise in function values and derivatives affect evaluation complexity of smooth optimization? While analysis has been produced to indicate how high accuracy can be reached by optimization algorithms even in the presence of inexact but deterministic (1) function and derivatives' values (see [8,16,28,3,29,21,14]), these approaches crucially rely on the assumption that the inexactness is controllable, in that it can be made arbitrarily small if required so by the algorithm. But what happens in practical applications where significant noise is intrinsic and can't be assumed away?…”

Section: Introductionmentioning

confidence: 99%

The Impact of Noise on Evaluation Complexity: The Deterministic Trust-Region Case

Bellavia¹,

Gurioli²,

Morini³

et al. 2021

Preprint

Self Cite

View full text Add to dashboard Cite

Intrinsic noise in objective function and derivatives evaluations may cause premature termination of optimization algorithms. Evaluation complexity bounds taking this situation into account are presented in the framework of a deterministic trust-region method. The results show that the presence of intrinsic noise may dominate these bounds, in contrast with what is known for methods in which the inexactness in function and derivatives' evaluations is fully controllable. Moreover, the new analysis provides estimates of the optimality level achievable, should noise cause early termination. It finally sheds some light on the impact of inexact computer arithmetic on evaluation complexity.

show abstract

Subsampled First-Order Optimization Methods with Applications in Imaging

Bellavia¹,

Bianconcini²,

Krejić³

et al. 2021

Handbook of Mathematical Models and Algorithms in Computer Vision and Imaging

View full text Add to dashboard Cite

Adaptive cubic regularization methods with dynamic inexact Hessian information and applications to finite-sum minimization

Cited by 30 publications

References 42 publications

A stochastic first-order trust-region method with inexact restoration for finite-sum minimization

A stochastic first-order trust-region method with inexact restoration for finite-sum minimization

The Impact of Noise on Evaluation Complexity: The Deterministic Trust-Region Case

Subsampled First-Order Optimization Methods with Applications in Imaging

Contact Info

Product

Resources

About