Neural Architecture Search for Convolutional Neural Networks with Attention

Nakai, Kohei; Matsubara, Takashi; Uehara, Kuniaki

doi:10.1587/transinf.2020edp7111

Cited by 6 publications

(5 citation statements)

References 25 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The downside of this algorithm is its running time; it takes more than 50 GPU days to optimize the model. Moreover, there are different methods, except the mentioned algorithm, in the literature like binarized neural networks [147], swarm intelligence [148], greedy optimizers [47,149], novelty search strategy [150], attention-based search [151], slow-fast learning [152], enhanced RL mixed with a new reward function [60], etc.…”

Section: Adaptive Trainingmentioning

confidence: 99%

Systematic Review on Neural Architecture Search

Avval,

Yaghoubi,

Eskue

et al. 2024

Preprint

View full text Add to dashboard Cite

Machine Learning (ML) has revolutionized various fields, enabling the development of intelligent systems capable of solving complex problems. However, the process of manually designing and optimizing ML models is often timeconsuming, labor-intensive, and requires specialized expertise. To address these challenges, Automatic Machine Learning (AutoML) has emerged as a promising approach that automates the process of selecting and optimizing ML models. Within the realm of AutoML, Neural Architecture Search (NAS) has emerged as a powerful technique that automates the design of neural network architectures, the core components of ML models. It has recently gained significant attraction due to its capability to discover novel and efficient architectures that surpass human-designed counterparts. This manuscript aims to present a systematic review of the literature on this topic published between 2017 and 2023 to identify, analyze, and classify the different types of algorithms developed for NAS. The methodology follows the guidelines of Systematic Literature Review (SLR) methods. Consequently, this study identified 160 articles that provide a comprehensive overview of the field of NAS, encompassing discussion on current works, their purposes, conclusions, and predictions of the direction of this science branch in its main core pillars: Search Space (SSp), Search Strategy (SSt), and Validation Strategy (VSt). Subsequently, the key milestones and advancements that have shaped the field are highlighted. Moreover, we discuss the challenges and open issues that remain in the field. We envision that NAS will continue to play a pivotal role in the advancement of ML, enabling the development of more intelligent and efficient ML models for a wide range of applications.

show abstract

Section: Adaptive Trainingmentioning

confidence: 99%

Systematic Review on Neural Architecture Search

Avval,

Yaghoubi,

Eskue

et al. 2024

Preprint

View full text Add to dashboard Cite

show abstract

“…Hao et al [47] proposed to introduce an attention mechanism in the network architecture to help the information interaction between candidate architectures, enabling the search process to focus on selecting better network architectures. Weng et al [48] and Nakai et al [49] both added attention mechanism modules as an operation to the search space. The former added the attention module directly into the search space, while the latter introduced a new attention search space containing multiple attention operations and concatenated the operations searched in the attention search space with the operations searched in the original search space, and both approaches improved the performance of the network architecture.…”

Section: Related Workmentioning

confidence: 99%

“…Other gradient-based neural network architecture search methods have explicit and implicit approaches to deal with the problem of poor stability in architecture search, respectively. P-DARTS [45], ASM-NAS [47], Att-DARTS [49], DARTS+ [51], DARTS- [52], MileNAS [54], R-DARTS [59],…”

Section: Relationship With Previous Workmentioning

confidence: 99%

“…Explicit P-DARTS [45] Adopting search space regularization ASM-NAS [47] Incorporating attention mechanisms in architectural search Att-DARTS [49] Applying attention mechanisms to search space DARTS+ [51] Adopting an early stopping strategy DARTS- [52] Introducing an auxiliary skip connection MileNAS [54] Adoption of early stopping strategy and mixed-level objective function R-DARTS [59] Using early stopping strategy and L2 regularization SharpDARTS [61] Using HyperCuboid search space and Max-W weight regularization ADARTS [65] Employing attention-based partial channel connection strategy Implicit PC-DARTS [44] Using coefficient regularization FairDARTS [50] Using the 0-1 loss function DOTS [53] Using decoupled operations and topology search CDARTS [55] Introducing an introspective distillation mechanism MS-DARTS [27] Using mean shift architecture parameter regularization GOLD-NAS [28] Leveraging computational resources to regularize architectural parameters GAEA-DARTS [29] Updating architecture parameters using exponential gradients β-DARTS [30] Using softmax function architecture parameter regularization InLM-NAS Introducing inner-loop mechanism regularization and gradient modification problem. In addition, the approximation used in this paper is more accurate compared to the one-step approximation used by all the above methods on the objective function.…”

Section: Table ⅰ Comparison With Prior Workmentioning

confidence: 99%

See 1 more Smart Citation

Inner Loop-Based Modified Differentiable Architecture Search

Jin,

Huang

2024

IEEE Access

View full text Add to dashboard Cite

Differentiable neural architecture search, which significantly reduces the computational cost of architecture search by several orders of magnitude, has become a popular research issue in recent years. Architecture search can fundamentally be described as an optimization problem. The differentiable architecture search updates the search process based on gradients, then derives the final sub-network architecture from the super network of the search space. However, the gap between the super network and its sub-networks together with the inaccuracy of the gradient approximation during architecture optimization bring performance collapse problems in the architecture search, making the search process extremely unstable. To this end, we propose an inner loop-based modified differentiable neural architecture search method (InLM-NAS). Firstly, we redefine the objective function of the architecture optimization process in the search process by introducing an inner-loop mechanism to prevent overfitting problems of architecture parameters and avoid convergence of the architecture search to suboptimal architectures. Secondly, a novel approximation calculation is introduced in the architecture optimization process, which reduces the error caused by the gradient approximation. It alleviates the sensitivity to the hyper-parameters setting during the architecture search and enhances the stability of the architecture search. Finally, extensive validation experiments on public datasets demonstrate that our proposed method has a more robust search process, and the searched neural network architecture has a superior network performance.

show abstract

“…In addition, the attention mechanism can help the neural network select useful features and discard the less-useful ones. Attention mechanism modules have been introduced to enrich the search space 24 , 25 to improve the architecture search performance. Some other methods have also used different search strategies 26 – 30 to try to alleviate the above problems.…”

Section: Introductionmentioning

confidence: 99%

Neural architecture search via progressive partial connection with attention mechanism

Jin,

Huang,

Chen

2024

Sci Rep

View full text Add to dashboard Cite

Differentiable architecture search requires a larger computational consumption during architecture search, and there exists the depth gap problem under deeper network architecture. In this paper, we propose an attention-based progressive partially connected neural architecture search method (PPCAtt-NAS) to address these two issues. First, we introduce a progressive search strategy in the architecture search phase, build up the sophistication of the architecture gradually and perform path-level pruning in stages to bridge the depth gap. Second, we adopt a partial search scheme that performs channel-level partial sampling of the network architecture to further reduce the computational complexity of the architecture search. In addition, an attention mechanism is devised to improve the architecture search capability by enhancing the relevance between the feature channels. Finally, we conduct extensive comparison experiments with state-of-the-art methods on several public datasets, and our method is able to present higher architecture performance.

show abstract

Neural Architecture Search for Convolutional Neural Networks with Attention

Cited by 6 publications

References 25 publications

Systematic Review on Neural Architecture Search

Systematic Review on Neural Architecture Search

Inner Loop-Based Modified Differentiable Architecture Search

Neural architecture search via progressive partial connection with attention mechanism

Contact Info

Product

Resources

About