GuidedMix: An on‐the‐fly data augmentation approach for robust speaker recognition system

Xiao, Runqiu; Li, Zhuo; Xing, Miao; Wang, Qianqian; Zhang, Pengyuan

doi:10.1049/ell2.12354

Cited by 2 publications

(1 citation statement)

References 16 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Convolutional neural networks (CNNs) play an important role in artificial intelligence, such as in computer vision (CV) ( Wu et al, 2022 ), natural language processing (NLP) ( Messina et al, 2021 ) and speaker recognition (SR) ( Xiao et al, 2022 ). However, researchers have recently pointed out that Transformer networks have made great progress in the field of NLP ( Lauriola, Lavelli & Aiolli, 2022 ) by solving the long-range text association problem using the Attention mechanism compared to CNN networks.…”

Section: Introductionmentioning

confidence: 99%

Enhancing the robustness of vision transformer defense against adversarial attacks based on squeeze-and-excitation module

Chang

Hong

Wang

2023

PeerJ Computer Science

View full text Add to dashboard Cite

Vision Transformer (ViT) models have achieved good results in computer vision tasks, their performance has been shown to exceed that of convolutional neural networks (CNNs). However, the robustness of the ViT model has been less studied recently. To address this problem, we investigate the robustness of the ViT model in the face of adversarial attacks, and enhance the robustness of the model by introducing the ResNet- SE module, which acts on the Attention module of the ViT model. The Attention module not only learns edge and line information, but also can extract increasingly complex feature information; ResNet-SE module highlights the important information of each feature map and suppresses the minor information, which helps the model to perform the extraction of key features. The experimental results show that the accuracy of the proposed defense method is 19.812%, 17.083%, 18.802%, 21.490%, and 18.010% against Basic Iterative Method (BIM), C&W, DeepFool, DI2FGSM, and MDI2FGSM attacks, respectively. The defense method in this paper shows strong robustness compared with several other models.

show abstract

Section: Introductionmentioning

confidence: 99%

Enhancing the robustness of vision transformer defense against adversarial attacks based on squeeze-and-excitation module

Chang

Hong

Wang

2023

PeerJ Computer Science

View full text Add to dashboard Cite

show abstract

Research on Photographic Image Classification Based on Multi-model Fusion and Data Augmentation

2023

2023 International Conference on Integrated Intelligence and Communication Systems (ICIICS)

View full text Add to dashboard Cite

GuidedMix: An on‐the‐fly data augmentation approach for robust speaker recognition system

Cited by 2 publications

References 16 publications

Enhancing the robustness of vision transformer defense against adversarial attacks based on squeeze-and-excitation module

Enhancing the robustness of vision transformer defense against adversarial attacks based on squeeze-and-excitation module

Research on Photographic Image Classification Based on Multi-model Fusion and Data Augmentation

Contact Info

Product

Resources

About