2018 26th European Signal Processing Conference (EUSIPCO) 2018
DOI: 10.23919/eusipco.2018.8553243
|View full text |Cite
|
Sign up to set email alerts
|

Trust-Region Minimization Algorithm for Training Responses (TRMinATR): The Rise of Machine Learning Techniques

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2018
2018
2021
2021

Publication Types

Select...
3
1

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(5 citation statements)
references
References 16 publications
0
5
0
Order By: Relevance
“…is already computed when we computed the gradient in (26). The line search multi-batch L-BFGS optimization algorithm for deep Q-Leaning is provided in Algorithm 4.…”
Section: L-bfgs Line-search Deep Q-learning Methodsmentioning
confidence: 99%
See 3 more Smart Citations
“…is already computed when we computed the gradient in (26). The line search multi-batch L-BFGS optimization algorithm for deep Q-Leaning is provided in Algorithm 4.…”
Section: L-bfgs Line-search Deep Q-learning Methodsmentioning
confidence: 99%
“…These methods not only have the benefit of being independent from the fine-tuning of hyperaparameters, but they may improve upon the training performance and the convergence robustness of the line-search methods. Furthermore, trust-region L-BFGS methods can easily reject the search directions if the curvature condition is not satisfied in order to preserve the positive definiteness of the L-BFGS matrices [26]. The computational bottleneck of trust-region methods is the solution of the trust-region subproblem.…”
Section: Motivationmentioning
confidence: 99%
See 2 more Smart Citations
“…While line search and trust-region methods have already been applied to ML, as demonstrated by Rafati et al (2018), they have not yet been used in combination with adaptive batch size. Also, Lederrey et al (2018a,b) have demonstrated that the use of a full batch data is required at the end of the optimization process to achieve the appropriate precision for DCMs.…”
Section: Adaptive Batch Sizementioning
confidence: 99%