Interspeech 2018 2018
DOI: 10.21437/interspeech.2018-1807
|View full text |Cite
|
Sign up to set email alerts
|

Who Said That? a Comparative Study of Non-negative Matrix Factorization Techniques

Abstract: In noisy environments it is difficult for a computer to understand what a person is saying, especially when there are multiple speakers. In this paper we concentrate on separating overlapping speech. Non-negative matrix factorisation (NMF) is a method of doing source separation without needing a lot of data. The choice of cost function can have a significant impact on the performance of NMF. We evaluate NMF using three different cost functions (Euclidean, Itakura-Saito and Kullback-Leibler), including modifica… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
3
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(3 citation statements)
references
References 17 publications
0
3
0
Order By: Relevance
“…The above result may be explained by the fact that all http://journals.uob.edu.bh [25], (b) NMF-EUC [4], (c) NMF-EUC, current investigation on an overlapped speech signal comprising same language speeches by English female and male speakers and (d) NMF-EUC, current investigation on an overlapped speech signal comprising different language by English male and Marathi female speakers. The y-axis scale for all the figures is same Indo-Aryan languages like Marathi and Bengali have more aspirated consonants than English, which are produced with an audible expulsion of breath, whereas the unaspirated are pronounced with minimal breath.…”
Section: B Signal Level Metricsmentioning
confidence: 94%
See 2 more Smart Citations
“…The above result may be explained by the fact that all http://journals.uob.edu.bh [25], (b) NMF-EUC [4], (c) NMF-EUC, current investigation on an overlapped speech signal comprising same language speeches by English female and male speakers and (d) NMF-EUC, current investigation on an overlapped speech signal comprising different language by English male and Marathi female speakers. The y-axis scale for all the figures is same Indo-Aryan languages like Marathi and Bengali have more aspirated consonants than English, which are produced with an audible expulsion of breath, whereas the unaspirated are pronounced with minimal breath.…”
Section: B Signal Level Metricsmentioning
confidence: 94%
“…They also point out that there is immense scope for improving audio source separation in overlapping speech scenarios. DNN, though it shows promising results in separation performance, is characterized by high computational complexity and suffers degraded performance on problems with limited training data or small data sets [25]. NMF, on the other hand, is still prevalent for separation with limited training datasets.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation