2021
DOI: 10.48550/arxiv.2110.10330
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

One model to enhance them all: array geometry agnostic multi-channel personalized speech enhancement

Abstract: With the recent surge of video conferencing tools usage, providing high-quality speech signals and accurate captions have become essential to conduct day-to-day business or connect with friends and families. Single-channel personalized speech enhancement (PSE) methods show promising results compared with the unconditional speech enhancement (SE) methods in these scenarios due to their ability to remove interfering speech in addition to the environmental noise. In this work, we leverage spatial information affo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 19 publications
0
1
0
Order By: Relevance
“…The array-geometry-agnostic modeling is useful for production. In parallel to this work, we examined its impact on personalized noise reduction [33]. Further investigation in different tasks is desired.…”
Section: Discussionmentioning
confidence: 99%
“…The array-geometry-agnostic modeling is useful for production. In parallel to this work, we examined its impact on personalized noise reduction [33]. Further investigation in different tasks is desired.…”
Section: Discussionmentioning
confidence: 99%