2024
DOI: 10.1109/access.2024.3361286
|View full text |Cite
|
Sign up to set email alerts
|

Spatio-Temporal Features Representation Using Recurrent Capsules for Monaural Speech Enhancement

Jawad Ali,
Nasir Saleem,
Sami Bourouis
et al.

Abstract: Single-channel speech enhancement is important for modern communication systems and has received a lot of attention. A convolutional neural network (CNN) successfully learns feature representations from speech spectrograms but loses spatial information due to distortion, which is important for humans to understand speech. Speech feature learning is an important ongoing research to capture higher-level representations of speech that go beyond conventional techniques. By considering the hierarchical structure an… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(2 citation statements)
references
References 86 publications
(80 reference statements)
0
2
0
Order By: Relevance
“…The x-axis represents time of sequences of spectra, and color brightness on the other axis represents the frequency of the strength of each component at each time frame. Spectrograms show where there is high or low energy, and how energy levels vary over time (Ali et al, 2024 ). In insect song synthesis, spectrograms capture the temporal and spectral characteristics of the insect sounds.…”
Section: Methodsmentioning
confidence: 99%
“…The x-axis represents time of sequences of spectra, and color brightness on the other axis represents the frequency of the strength of each component at each time frame. Spectrograms show where there is high or low energy, and how energy levels vary over time (Ali et al, 2024 ). In insect song synthesis, spectrograms capture the temporal and spectral characteristics of the insect sounds.…”
Section: Methodsmentioning
confidence: 99%
“…When combined with recurrent features of Recurrent Neural Networks (RNN), this hybridization can be useful in environments that require memory over time. Studies such as (Wu et al, 2021;Sezavar et al, 2024;Ali et al, 2024) have demonstrated the optimal performance of Cap-sNets with recurrent structures.…”
Section: Hybrid Models and Their Potential In Advancing Deep Learningmentioning
confidence: 99%