2022
DOI: 10.1007/s10462-022-10153-0
|View full text |Cite
|
Sign up to set email alerts
|

EnvGAN: a GAN-based augmentation to improve environmental sound classification

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 13 publications
(4 citation statements)
references
References 46 publications
0
4
0
Order By: Relevance
“…As mentioned earlier, although existing GAN models have shown the potential of sound data augmentation [ 19 ], their effectiveness is quite limited in multi-class applications such as ASC. To overcome this, we propose DualDiscWaveGAN, a new GAN model for conditional animal sound generation.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…As mentioned earlier, although existing GAN models have shown the potential of sound data augmentation [ 19 ], their effectiveness is quite limited in multi-class applications such as ASC. To overcome this, we propose DualDiscWaveGAN, a new GAN model for conditional animal sound generation.…”
Section: Methodsmentioning
confidence: 99%
“…They showed that the accuracy of two classifiers trained with augmented data by WCCGAN improved significantly on four environmental sound datasets. Madhu and Suresh [ 19 ] developed an unconditional GAN-based augmentation model by adding two layers and one stable loss function to WaveGAN [ 20 ] to generate longer virtual waveforms suitable for representing environmental sounds. However, this approach requires a class-specific generative model to generate class-specific data properly in a multi-class environment.…”
Section: Related Workmentioning
confidence: 99%
“…There are plenty of works that address the problem of low data regimes [39]- [45]. However, their datasets and model architectures differ from those used in this work.…”
Section: B Computer Vision Vs Cybersecurity Low Data Regimesmentioning
confidence: 99%
“…In image processing, GANs are widely used for generating and restoring images of faces, scenes, and Renaissance styles [10,11]. In the field of speech, GANs are utilized for generating high-quality speech samples and speech recognition [12,13]. In natural language processing, GANs are employed to generate text content with good semantics [14,15].…”
Section: Introductionmentioning
confidence: 99%