2012 20th Telecommunications Forum (TELFOR) 2012
DOI: 10.1109/telfor.2012.6419311
|View full text |Cite
|
Sign up to set email alerts
|

Application of neural networks in whispered speech recognition

Abstract: Sadržaj -U radu su prezentovani eksperimentalni rezultati istraživanja prepoznavanja šapata, kao specifi nog oblika verbalne komunikacije, primenom vešta kih neuronskih mreža (ANN). Prikazana je govorna baza re i izgovorenih normalnim govorom i šapatom, posebno formirana za ovo istraživanje, iji deo je upotrebljen za obuku i testiranje ANN. Testiran je slu aj prepoznavanja zavisno od govornika a rezultati su pokazali 100% prepoznavanja u slu aju govora i 99,3% u slu aju šapata. U slu aju prepoznavanja šapata, … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

1
3
0

Year Published

2014
2014
2021
2021

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 9 publications
(4 citation statements)
references
References 6 publications
1
3
0
Order By: Relevance
“…This degradation is not as pronounced if an AM trained from whispered speech were to be used to recognize neutral speech. This result has been consistent for experiments done in several languages for which there is a sizeable corpus of parallel whispered speech, such as Serbian [3], Japanese [4], Mandarin [5] and English [6,7]. The pattern is also consistent for different types of models, both Gaussian Mixture Models (GMM) trained with generative or discriminative methods, and even for Deep Neural Network (DNN) based AMs.…”
Section: Related Worksupporting
confidence: 81%
“…This degradation is not as pronounced if an AM trained from whispered speech were to be used to recognize neutral speech. This result has been consistent for experiments done in several languages for which there is a sizeable corpus of parallel whispered speech, such as Serbian [3], Japanese [4], Mandarin [5] and English [6,7]. The pattern is also consistent for different types of models, both Gaussian Mixture Models (GMM) trained with generative or discriminative methods, and even for Deep Neural Network (DNN) based AMs.…”
Section: Related Worksupporting
confidence: 81%
“…However, the effectiveness of E2E approaches over whispered speech is yet to be confirmed. Previous works suggested that deep learning was useful for whispered speech recognition [21,22,23], while the success of E2E approaches on normal speech ASR was widely believed to depend on the quantity of data [24] and the model architecture [25]. It is much more difficult to collect whispered speech data of reasonable size, and the special characteristics of whispered speech may need special considerations in model design and training.…”
Section: Introductionmentioning
confidence: 99%
“…Whispering is a natural mode of speech communication which in recent years has garnered more attention in the speech community [1,2,3,4,5,6]. During whisper, vocal folds do not vibrate and consequently pitch is absent [7].…”
Section: Introductionmentioning
confidence: 99%