ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2022
DOI: 10.1109/icassp43922.2022.9746273
|View full text |Cite
|
Sign up to set email alerts
|

Dual-Branch Attention-In-Attention Transformer for Single-Channel Speech Enhancement

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
25
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 59 publications
(25 citation statements)
references
References 28 publications
0
25
0
Order By: Relevance
“…Since the phase is unstructured and always challenging to estimate, this might result in an inaccurate magnitude estimation to compensate for the challenging phase. This problem can be mitigated by including both complex and magnitude losses or by complex refinement approaches, which basically decouple the problem into estimating a bounded mask for the magnitude followed by a complex refinement branch to further improve the magnitude and estimate the phase from the denoised complex representations [13], [24], [58]- [60]. However, since recent studies recommended mapping-based methods over the preceding masking-based approaches for complex spectrogram estimation [22], [61], the complex refinement branch would follow a mapping-based approach.…”
Section: A Denoisingmentioning
confidence: 99%
See 4 more Smart Citations
“…Since the phase is unstructured and always challenging to estimate, this might result in an inaccurate magnitude estimation to compensate for the challenging phase. This problem can be mitigated by including both complex and magnitude losses or by complex refinement approaches, which basically decouple the problem into estimating a bounded mask for the magnitude followed by a complex refinement branch to further improve the magnitude and estimate the phase from the denoised complex representations [13], [24], [58]- [60]. However, since recent studies recommended mapping-based methods over the preceding masking-based approaches for complex spectrogram estimation [22], [61], the complex refinement branch would follow a mapping-based approach.…”
Section: A Denoisingmentioning
confidence: 99%
“…For the time-domain methods, we included the standard SEGAN [14] and three recent methods: TSTNN [16], DEMUCS [17] and SE-Conformer [18]. For the TF-domain methods, we evaluate six recent SOTA methods, i.e., MetricGAN [11], PHASEN [12], PFPL [104], Metric-GAN+ [105], DB-AIAT [13] and DPT-FSNet [26]. It can be observed that most of the TF-domain methods outperform the time-domain counterparts over all utilized metrics.…”
Section: A Denoisingmentioning
confidence: 99%
See 3 more Smart Citations