Improving the Convergence of the Backpropagation Algorithm Using Learning Rate Adaptation Methods

Magoulas, George D.; Vrahatis, Michael N.; Androulakis, G. S.

doi:10.1162/089976699300016223

Cited by 120 publications

(77 citation statements)

References 23 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…For this study, we used the implementations in the Xfuzzy environment, see [39] for a more detailed description of the wide range of methods supported. Among them, we distinguish four classes of methods: gradient descent [32], conjugate gradient, second order or quasi-Newton [3], and algorithms with no derivatives. Table 5 shows the test errors for the best option from each of the first three classes of algorithms: Resilient Propagation (Rprop) [42,32], from the gradient descent class, Scaled Conjugated Gradient (SCG) [35], from the conjugate gradient class, and Levenberg-Marquardt (L-M) [3], from the second order class of methods.…”

Section: B Comparison Of Different Neuro-fuzzy Methodsmentioning

confidence: 99%

Autoregressive time series prediction by means of fuzzy inference systems using nonparametric residual variance estimation

Pouzols¹,

Lendasse²,

Barriga³

2010

Fuzzy Sets and Systems

View full text Add to dashboard Cite

We propose an automatic methodology framework for short-and long-term prediction of time series by means of fuzzy inference systems. In this methodology, fuzzy techniques and statistical techniques for nonparametric residual variance estimation are combined in order to build autoregressive predictive models implemented as fuzzy inference systems. Furthermore, fuzzy models are shown to be consistently more accurate for prediction in the case of time series coming from real-world applications

show abstract

Section: B Comparison Of Different Neuro-fuzzy Methodsmentioning

confidence: 99%

Autoregressive time series prediction by means of fuzzy inference systems using nonparametric residual variance estimation

Pouzols¹,

Lendasse²,

Barriga³

2010

Fuzzy Sets and Systems

View full text Add to dashboard Cite

show abstract

“…It is based on the idea of function comparison methods (Scales, 1985) taking into account E(t À 1) < E(t), and exploits the signs of the gradient values. The parameter q is a reduction factor that is used to update the midpoint of the considered interval; choice of q has an influence on the number of error function evaluations required to obtain an acceptable weight vector (Magoulas et al, 1999 …”

Section: Implementation Of the Jrpropmentioning

confidence: 99%

“…Adaptive gradient-based algorithms with individual step-sizes try to overcome the inherent difficulty of choosing the right learning rates for each region of the search space depending on the application (Magoulas et al, 1997(Magoulas et al, , 1999. This is done by controlling the weight update of each weight in order to minimize oscillations and maximize the length of the step-size.…”

Section: Introductionmentioning

confidence: 99%

Sign-based learning schemes for pattern classification

Anastasiadis

Magoulas

Vrahatis

2005

Pattern Recognition Letters

View full text Add to dashboard Cite

This paper introduces a new class of sign-based training algorithms for neural networks that combine the sign-based updates of the Rprop algorithm with the composite nonlinear Jacobi method. The theoretical foundations of the class are described and a heuristic Rprop-based Jacobi algorithm is empirically investigated through simulation experiments in benchmark pattern classification problems. Numerical evidence shows that this new modification of the Rprop algorithm exhibits improved learning speed in all cases tested, and compares favorably against the Rprop and a recently proposed modification, the improved Rprop.

show abstract

“…Many researches are executed for increasing the convergence speed of EBP algorithm in MLP Neural Network see [2,3,8,7,13,12,14]. In [2,3] Abid and Fnaiech summarize the approaches for increasing the convergence speed of EBP onto seven cases including the weight updating procedure, the optimization criterion choice, the use of adaptive parameters, estimation of optimal initial conditions, pre-processing the problem before using MLP, optimization of MLP structure and the use of more advanced algorithms.…”

Section: Introductionmentioning

confidence: 99%

“…In this paper we concentrate on the dynamic learning rate of the learning approach in order to update the weights of the networks similar to what was implemented in [7,8,12]. Thus we implemented a Variable Step Size (VSS) method to increase the convergence acceleration of the algorithm by reducing the learning epochs.…”

Section: Introductionmentioning

confidence: 99%

AVLR-EBP: A Variable Step Size Approach to Speed-up the Convergence of Error Back-Propagation Algorithm

Didandeh

Mirbakhsh

Amiri

2011

Neural Process Lett

View full text Add to dashboard Cite

A critical issue of Neural Network based large-scale data mining algorithms is how to speed up their learning algorithm. This problem is particularly challenging for Error Back-Propagation (EBP) algorithm in Multi-Layered Perceptron (MLP) Neural Networks due to their significant applications in many scientific and engineering problems. In this paper, we propose an Adaptive Variable Learning Rate EBP algorithm to attack the challenging problem of reducing the convergence time in an EBP algorithm, aiming to have a highspeed convergence in comparison with standard EBP algorithm. The idea is inspired from adaptive filtering, which leaded us into two semi-similar methods of calculating the learning rate. Mathematical analysis of AVLR-EBP algorithm confirms its convergence property. The AVLR-EBP algorithm is utilized for data classification applications. Simulation results on many well-known data sets shall demonstrate that this algorithm reaches to a considerable reduction in convergence time in comparison to the standard EBP algorithm. The proposed algorithm, in classifying the IRIS, Wine, Breast Cancer, Semeion and SPECT Heart datasets shows a reduction of the learning epochs relative to the standard EBP algorithm.

show abstract

Improving the Convergence of the Backpropagation Algorithm Using Learning Rate Adaptation Methods

Cited by 120 publications

References 23 publications

Autoregressive time series prediction by means of fuzzy inference systems using nonparametric residual variance estimation

Autoregressive time series prediction by means of fuzzy inference systems using nonparametric residual variance estimation

Sign-based learning schemes for pattern classification

AVLR-EBP: A Variable Step Size Approach to Speed-up the Convergence of Error Back-Propagation Algorithm

Contact Info

Product

Resources

About