2021
DOI: 10.48550/arxiv.2102.05245
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Low-Complexity, Real-Time Joint Neural Echo Control and Speech Enhancement Based On PercepNet

Abstract: Speech enhancement algorithms based on deep learning have greatly surpassed their traditional counterparts and are now being considered for the task of removing acoustic echo from hands-free communication systems. This is a challenging problem due to both real-world constraints like loudspeaker non-linearities, and to limited compute capabilities in some communication systems. In this work, we propose a system combining a traditional acoustic echo canceller, and a low-complexity joint residual echo and noise s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(4 citation statements)
references
References 15 publications
0
4
0
Order By: Relevance
“…In [2] a similar idea was followed, however, only for a very small subset of the possible combinations, a rather simple feedforward neural network, and without explicit consideration of noise reduction capabilities. Furthermore, an intuitive and often seen input signal combination of a RES/NR network is the pair of enhanced signal E (k) and reference signal X (k), which delivers state of the art performance, e.g., in [4,9]. But as the reference signal does not contain any information about the room characteristics yet, could the estimated echo signal D (k) be a better choice?…”
Section: Experimental Designmentioning
confidence: 99%
See 1 more Smart Citation
“…In [2] a similar idea was followed, however, only for a very small subset of the possible combinations, a rather simple feedforward neural network, and without explicit consideration of noise reduction capabilities. Furthermore, an intuitive and often seen input signal combination of a RES/NR network is the pair of enhanced signal E (k) and reference signal X (k), which delivers state of the art performance, e.g., in [4,9]. But as the reference signal does not contain any information about the room characteristics yet, could the estimated echo signal D (k) be a better choice?…”
Section: Experimental Designmentioning
confidence: 99%
“…In the meantime, also fully learned deep AEC approaches were proposed, where a single network incorporates the tasks of AEC, RES, and NR, e.g., [6,7] or further investigated in [8]. Although showing an impressive suppression performance, however, these are often accompanied by some near-end speech degradation and-at least for now-hybrid approaches are still one step ahead as can be seen with the leading model of the AEC Challenge on ICASSP 2021 being a hybrid approach [9].…”
Section: Introductionmentioning
confidence: 99%
“…Hybrid approaches, however, have proven very successful. For a variety of AEC tasks [37]- [45], for example, neural networks are used for residual echo suppression, noise suppression, or related. In a similar vein, neural networks paired with AFs for active noise control tasks have proven successful [46]- [49].…”
Section: Introductionmentioning
confidence: 99%
“…Customized AEC adaptive filters take many forms including algorithms based on sparsity [6], adaptive normalization [7], and adaptive learning-rates [8], as well as data-driven approaches for selecting learning rates automatically [9,10] and based on a meta-stepsize [11,12]. More recently, deep learning techniques have been used as AEC sub-components including learned residual echo suppressors [13,14,15], double-talk detectors [16], and nonlinear distortions blocks [17,18,19,20,21]. These approaches, however, commonly do not use neural network modules that adapt at test In the machine learning literature, there have been exciting developments in meta-learning, automatic machine learning, and learning how to learn methods.…”
Section: Introductionmentioning
confidence: 99%