2017 IEEE International Autumn Meeting on Power, Electronics and Computing (ROPEC) 2017
DOI: 10.1109/ropec.2017.8261588
|View full text |Cite
|
Sign up to set email alerts
|

Speech recognition using deep neural networks trained with non-uniform frame-level cost functions

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2018
2018
2020
2020

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 7 publications
(3 citation statements)
references
References 25 publications
0
3
0
Order By: Relevance
“…They adopted two techniques, namely DNN-based regression to enhance reverberant and noisy speech, followed by DNN-based multicondition training for recognition. Becerra et al [37] introduced two new variations of the frame-level cost function for training a DNN in order to achieve better speech recognition. Due to the profound differences between acoustic characteristics of neutral and whispered speech, the performance of traditional automatic Speech Recognition (ASR) systems trained on neutral speech degrades significantly when the whisper is applied.…”
Section: Voice and Video Processingmentioning
confidence: 99%
See 1 more Smart Citation
“…They adopted two techniques, namely DNN-based regression to enhance reverberant and noisy speech, followed by DNN-based multicondition training for recognition. Becerra et al [37] introduced two new variations of the frame-level cost function for training a DNN in order to achieve better speech recognition. Due to the profound differences between acoustic characteristics of neutral and whispered speech, the performance of traditional automatic Speech Recognition (ASR) systems trained on neutral speech degrades significantly when the whisper is applied.…”
Section: Voice and Video Processingmentioning
confidence: 99%
“…The use of Deep Learning (DL) methods, which are the multi-layered structure of ANNs along with the improvements of GPU technology, have accelerated these advances. Furthermore, DL approaches have significantly outperformed state-of-the-art approaches in many fields such as object recognition [1,3,7,9,25,26], image processing [11,[27][28][29][30][31][32], computer vision [33][34][35][36], speech recognition [37][38][39], natural language processing (NLP) [10,21,27,[40][41][42], character recognition [5,30,[43][44][45][46], signature verification [2,6,[47][48][49][50][51]. Although the foundations of DL were based on ANN proposed by McCulloch and Pitts in 1943 [52], the real popularity has increased in 2012.…”
Section: Introduction (Gi̇ri̇ş)mentioning
confidence: 99%
“…The use of Deep Learning (DL) methods, which are the multi-layered structure of ANNs along with the improvements of GPU technology, have accelerated these advances. Furthermore, DL approaches have significantly outperformed state-of-the-art approaches in many fields such as object recognition [1,3,7,9,25,26], image processing [11,[27][28][29][30][31][32], computer vision [33][34][35][36], speech recognition [37][38][39], natural language processing (NLP) [10,21,27,[40][41][42], character recognition [5,30,[43][44][45][46], signature verification [2,6,[47][48][49][50][51]. Although the foundations of DL were based on ANN proposed by McCulloch and Pitts in 1943 [52], the real popularity has increased in 2012.…”
Section: Introduction (Gi̇ri̇ş)mentioning
confidence: 99%