2022
DOI: 10.1109/taslp.2022.3192104
|View full text |Cite
|
Sign up to set email alerts
|

Neural Cascade Architecture for Multi-Channel Acoustic Echo Suppression

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
6
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
6
1
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 19 publications
(6 citation statements)
references
References 47 publications
0
6
0
Order By: Relevance
“…For example, it has been suggested that ancient rock art is often created at places that can generate maximal echo intensities [ 42 ], and there are a number of places that are famous for generating echoes, such as the echo wall at the Temple of Heaven in Beijing and the whispering gallery of St Paul’s Cathedral in London. During modern teleconferencing, echoes are frequently encountered: An echo is generated when the voice from one side of the conversation is picked up by the microphone on another side and transmitted back [ 32 ]. In real teleconference recordings, echoes almost always have >100 ms latency [ 43 ].…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…For example, it has been suggested that ancient rock art is often created at places that can generate maximal echo intensities [ 42 ], and there are a number of places that are famous for generating echoes, such as the echo wall at the Temple of Heaven in Beijing and the whispering gallery of St Paul’s Cathedral in London. During modern teleconferencing, echoes are frequently encountered: An echo is generated when the voice from one side of the conversation is picked up by the microphone on another side and transmitted back [ 32 ]. In real teleconference recordings, echoes almost always have >100 ms latency [ 43 ].…”
Section: Discussionmentioning
confidence: 99%
“…In daily reverberant environments, however, the power of sound reflection decays exponentially as the delay of the reflection increases [ 31 ]. Echoic environments are prevalent during online conferencing [ 32 ]. Previous studies have shown that the influence of reverberation on speech envelope can be compensated through basic neural adaptation mechanisms [ 9 , 33 ].…”
Section: Introductionmentioning
confidence: 99%
“…With recent advances in deep neural networks, deep learningbased methods [5,6,7] have been utilized for AEC, and their ability to model nonlinear relations leads to promising results, even in challenging noisy or double-talk scenarios. Such methods usually treat AEC as a source separation problem and directly estimate the nearend signal based on the microphone and far-end reference signal.…”
Section: Introductionmentioning
confidence: 99%
“…Besides, Peng et al [14] described a threestage AEC and suppression framework for the ICASSP 2021 AEC Challenge, where the partitioned block frequency domain least mean square (PBFDLMS) with a time alignment was firstly implemented to cancel the linear echo components, and two deep learning networks were then proposed to suppress the residual echo and the non-speech residual noise simultaneously. In addition, Zhang et al [15] proposed a neural cascade architecture, including a CRN module and an LSTM module, which is used for joint acoustic echo and noise suppression to address both single-channel and multi-channel AEC problems. More recently, Cheng et al [16] proposed a deep complex multi-frame filtering network for stereophonic AEC, where two deep learning-based modules were separately used for suppression of the linear and residual echo components.…”
Section: Introductionmentioning
confidence: 99%