Neural Architecture Search for Transformers: A Survey

Chitty-Venkata, Krishna Teja; Emani, Murali; Vishwanath, Venkatram; Somani, Arun K.

doi:10.1109/access.2022.3212767

Cited by 39 publications

(15 citation statements)

References 217 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In contrast, NAS has not yet been fully explored for ViTs. In [160], the authors surveyed several NAS techniques for ViTs. To the best of our knowledge, there are limited studies on the NAS exploration in ViTs [161][162][163][164][165][166], and more attention is needed in the future.…”

Section: Neural Architecture Search (Nas)mentioning

confidence: 99%

A Comprehensive Survey of Transformers for Computer Vision

Jamil

Piran

Kwon

2023

Drones

View full text Add to dashboard Cite

As a special type of transformer, vision transformers (ViTs) can be used for various computer vision (CV) applications. Convolutional neural networks (CNNs) have several potential problems that can be resolved with ViTs. For image coding tasks such as compression, super-resolution, segmentation, and denoising, different variants of ViTs are used. In our survey, we determined the many CV applications to which ViTs are applicable. CV applications reviewed included image classification, object detection, image segmentation, image compression, image super-resolution, image denoising, anomaly detection, and drone imagery. We reviewed the state of the-art and compiled a list of available models and discussed the pros and cons of each model.

show abstract

Section: Neural Architecture Search (Nas)mentioning

confidence: 99%

A Comprehensive Survey of Transformers for Computer Vision

Jamil

Piran

Kwon

2023

Drones

View full text Add to dashboard Cite

show abstract

“…The cross entropy between 𝑝(𝑥) and 𝑞(𝑥; 𝜃) is given by 𝐻(𝑝, 𝑞) ≡ −𝔼 𝑝 [log 𝑞(𝑥; 𝜃)], (16) where the second equality holds if the number of samples (𝑚) is large enough. Equation (16) shows that maximizing the likelihood in (15) with respect to the parameter 𝜃 is equivalent to minimizing the cross entropy of (13).…”

Section: Appendixmentioning

confidence: 99%

“…Thus, they can be regarded as a special kind of regularization [12] to improve overfitting due to small datasets. At present, Transformer has become the mainstream architecture of PTMs for NLP tasks [13]. The well-known pre-trained language models BERT, GPT-2 and GPT-3 [14]- [15] are extensions of the Transformer architecture.…”

Section: Introductionmentioning

confidence: 99%

Performance Improvement on Traditional Chinese Task-Oriented Dialogue Systems With Reinforcement Learning and Regularized Dropout Technique

Sheu

Wu³

2023

IEEE Access

View full text Add to dashboard Cite

The development of conversational voice assistant applications has been in full swing around the world. This paper aims to develop traditional Chinese multi-domain task-oriented dialogue (TOD) systems. It is typically implemented using pipeline approach, where submodules are optimized independently, resulting in inconsistencies with each other. Instead, this paper implements end-to-end multi-domain TOD models using pre-trained deep neural networks (DNNs). This allows us to integrate all the submodules into one single DNN model to solve the inconsistencies. Data shortages are common in conversational natural language processing (NLP) tasks using DNN models. In this regard, dropout regularization has been widely used to improve overfitting caused by insufficient training dataset. However, the randomness it introduces leads to non-negligible discrepancies between training and inference. On the other hand, pre-trained language models have successfully provided effective regularization for NLP tasks. An inherent disadvantage is that fine-tuning the pre-trained language model suffers from exposure bias and loss-evaluation mismatch. To this end, we propose a reinforcement learning (RL) approach to address both issues. Furthermore, we adopt a method called regularized dropout (R-Drop) to improve the inconsistency in dropout layers of DNNs. Experimental results show that both our proposed RL approach and the R-Drop technique can significantly improve the joint target accuracy (JGA) score and combined score of traditional Chinese TOD system in tasks of dialogue state tracking (DST) and end-to-end sentence prediction, respectively.INDEX TERMS NLP, regularized dropout, reinforcement learning, task-oriented dialogue

show abstract

“…Utilizing this self‐built prototype system, we successfully obtained hyperspectral images of four distinct bacterial pathogens and analyzed their spectral differences. Additionally, inspired by the great achievements of the Transformer network in the field of natural language processing (NLP) and image processing, such as ChatGPT, ViT, and so on [23]. This article is dedicated to extending the network architecture to identify hyperspectral images of infectious pathogens.…”

Section: Introductionmentioning

confidence: 99%

Hyperspectral upgrade solution for biomicroscope combined withTransformernetwork to classify infectious bacteria

Lu,

Zhang,

Wang

et al. 2024

Journal of Biophotonics

View full text Add to dashboard Cite

Infectious diseases caused by bacterial pathogens pose a significant public health threat, emphasizing the need for swift and accurate bacterial species detection methods. Hyperspectral microscopic imaging (HMI) offers nondestructive, rapid, and data‐rich advantages, making it a promising tool for microbial detection. In this research, we present a highly compatible and cost‐effective approach to extend a standard biomicroscope system into a hyperspectral biomicroscope using a prism‐grating‐prism configuration. Using this prototype, we generate 600 hyperspectral data cubes for Listeria, Bacillus typhi, Bacillus pestis, and Bacillus anthracis. Additionally, we propose a Transformer‐based classification network that achieves a 99.44% accuracy in classifying these infectious pathogens, outperforming traditional methods. Our results suggest that the successful combination of HMI and the optimized Transformer‐based classification network highlights the potential for rapid and precise detection of infectious disease pathogens .

show abstract

Neural Architecture Search for Transformers: A Survey

Cited by 39 publications

References 217 publications

A Comprehensive Survey of Transformers for Computer Vision

A Comprehensive Survey of Transformers for Computer Vision

Performance Improvement on Traditional Chinese Task-Oriented Dialogue Systems With Reinforcement Learning and Regularized Dropout Technique

Hyperspectral upgrade solution for biomicroscope combined withTransformernetwork to classify infectious bacteria

Contact Info

Product

Resources

About