2022
DOI: 10.1609/aaai.v36i10.21419
|View full text |Cite
|
Sign up to set email alerts
|

Reference-Based Speech Enhancement via Feature Alignment and Fusion Network

Abstract: Speech enhancement aims at recovering a clean speech from a noisy input, which can be classified into single speech enhancement and personalized speech enhancement. Personalized speech enhancement usually utilizes the speaker identity extracted from the noisy speech itself (or a clean reference speech) as a global embedding to guide the enhancement process. Different from them, we observe that the speeches of the same speaker are correlated in terms of frame-level short-time Fourier Transform (STFT) spectrogra… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 9 publications
(1 citation statement)
references
References 38 publications
0
1
0
Order By: Relevance
“…There is a method [29] to improve speech quality by extracting Mel frequency spectral coefficients (MFCCs) from noisy signals and improving them using deep features from high-quality reference MFCCs spectrograms.…”
Section: Reference-based Approachmentioning
confidence: 99%
“…There is a method [29] to improve speech quality by extracting Mel frequency spectral coefficients (MFCCs) from noisy signals and improving them using deep features from high-quality reference MFCCs spectrograms.…”
Section: Reference-based Approachmentioning
confidence: 99%