2019
DOI: 10.48550/arxiv.1904.07845
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Improved Speech Separation with Time-and-Frequency Cross-domain Joint Embedding and Clustering

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 17 publications
0
3
0
Order By: Relevance
“…a is the projection of b onto a. SI-SNR has also been used in many end-to-end separation models [6,10,30,2].…”
Section: Permutation Invariant Trainingmentioning
confidence: 99%
“…a is the projection of b onto a. SI-SNR has also been used in many end-to-end separation models [6,10,30,2].…”
Section: Permutation Invariant Trainingmentioning
confidence: 99%
“…1. For example, we apply the recently proposed scale-invariant signal-to-noise ratio (SI-SNR) [8,18] with PIT [6,7]:…”
Section: Proposed Approachmentioning
confidence: 99%
“…The time-frequency domain method is to establish the frequency domain signal characteristics through short-time Fourier transform, use the T-F [5] method to separate the signal, and finally reconstruct the source waveform through inverse. The time-frequency domain methods mainly include [5,6] . The endto-end time-domain method mainly uses the waveform mixed in the original stage.…”
Section: Introductionmentioning
confidence: 99%