2022
DOI: 10.1109/tetci.2021.3083428
|View full text |Cite
|
Sign up to set email alerts
|

AdaSwarm: Augmenting Gradient-Based Optimizers in Deep Learning With Swarm Intelligence

Abstract: This paper introduces AdaSwarm, a novel gradientfree optimizer which has similar or even better performance than the Adam optimizer adopted in neural networks. In order to support our proposed AdaSwarm, a novel Exponentially weighted Momentum Particle Swarm Optimizer (EMPSO), is proposed. The ability of AdaSwarm to tackle optimization problems is attributed to its capability to perform good gradient approximations. We show that, the gradient of any function, differentiable or not, can be approximated by using … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 19 publications
(3 citation statements)
references
References 37 publications
0
3
0
Order By: Relevance
“…e ways are proposed to discover operations in the early stages of aerospace design [6]. In [7], the authors described a random multimodal deep learning (RMDL) which is an ensemble system to break the problem of finding a stylish deep learning structure.…”
Section: Related Workmentioning
confidence: 99%
“…e ways are proposed to discover operations in the early stages of aerospace design [6]. In [7], the authors described a random multimodal deep learning (RMDL) which is an ensemble system to break the problem of finding a stylish deep learning structure.…”
Section: Related Workmentioning
confidence: 99%
“…The spectrum of problems in which EAs can be used is very wide. EAs have been traditionally applied to optimize neural networks (Iba, 2018), but their usage in DL networks to improve DL networks (Martinez et al, 2021), to train them (Mohapatra et al, 2021), and to create new DL networks from scratch (Elsken et al, 2019b) is more recent. The use of EA's is mainly oriented towards optimizing a complete network.…”
Section: Introductionmentioning
confidence: 99%
“…The spectrum of problems in which EAs can be used is very wide. EAs have been traditionally applied to optimize neural networks (Iba, 2018), but their usage in DL networks to improve DL networks (Martinez et al, 2021), to train them (Mohapatra et al, 2021), and to create new DL networks from scratch (Elsken et al, 2019b) is more recent. The use of EA's is mainly oriented towards optimizing a complete network.…”
Section: Introductionmentioning
confidence: 99%