2019
DOI: 10.1121/1.5093547
|View full text |Cite
|
Sign up to set email alerts
|

A deep learning algorithm to increase intelligibility for hearing-impaired listeners in the presence of a competing talker and reverberation

Abstract: For deep learning based speech segregation to have translational significance as a noise-reduction tool, it must perform in a wide variety of acoustic environments. In the current study, performance was examined when target speech was subjected to interference from a single talker and room reverberation. Conditions were compared in which an algorithm was trained to remove both reverberation and interfering speech, or only interfering speech. A recurrent neural network incorporating bidirectional long short-ter… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

1
23
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 34 publications
(24 citation statements)
references
References 31 publications
1
23
0
Order By: Relevance
“…In recent years, deep learning has been widely applied in the fields of computer vision and speech recognition, and has achieved great success ( Varadarajan et al, 2020 ). The deep neural network technology can also be used in the recommendation system, such as the music recommendation function based on the convolutional neural network, which can extract the tonal features of music; the personalized recommendation technology based on the reverse artificial neural network can recode the feature into the low-dimensional vector and calculate the prediction score using the implicit feature; the recommendation system based on the restricted Boltzmann machine can use the visual layer to automatically decode and encode the new scoring data of scored items, and predict the scores of unscored items ( Healy et al, 2019 ; Raissi et al, 2019 ; Tuli et al, 2020 ). The application of deep neural network greatly improves the computational accuracy of the current recommendation system compared with the past.…”
Section: Introductionmentioning
confidence: 99%
“…In recent years, deep learning has been widely applied in the fields of computer vision and speech recognition, and has achieved great success ( Varadarajan et al, 2020 ). The deep neural network technology can also be used in the recommendation system, such as the music recommendation function based on the convolutional neural network, which can extract the tonal features of music; the personalized recommendation technology based on the reverse artificial neural network can recode the feature into the low-dimensional vector and calculate the prediction score using the implicit feature; the recommendation system based on the restricted Boltzmann machine can use the visual layer to automatically decode and encode the new scoring data of scored items, and predict the scores of unscored items ( Healy et al, 2019 ; Raissi et al, 2019 ; Tuli et al, 2020 ). The application of deep neural network greatly improves the computational accuracy of the current recommendation system compared with the past.…”
Section: Introductionmentioning
confidence: 99%
“…These have been shown to improve the intelligibility of speech in stationary noise for CI users (Loizou et al, 2005;Dawson et al, 2011;Mauger et al, 2012) and NH listeners using CI simulations (Bolner et al, 2016;Lai et al, 2018). Data-based algorithms using machine-learning (ML) techniques, such as deep neural networks (DNNs) or Gaussian mixture models (GMMs), were successful for speech in non-stationary, multi-talker babble and achieved significant SI improvements for NH (Kim et al, 2009;Bentsen et al, 2018), hearing-impaired (HI; Healy et al, 2013;Healy et al, 2015;Healy et al, 2019;Chen et al, 2016;Monaghan et al, 2017;Bramsløw et al, 2018), and CI listeners (Hu and Loizou, 2010;Goehring et al, 2017;Lai et al, 2018). Improvements of more recent approaches over earlier ones have been mainly driven by two factors: the use of more powerful DNN-based regression systems instead of classification systems, and the use of a ratio mask instead of a binary mask as the training target (Madhu et al, 2013;Bentsen et al, 2018).…”
Section: Introductionmentioning
confidence: 99%
“…Improvements of more recent approaches over earlier ones have been mainly driven by two factors: the use of more powerful DNN-based regression systems instead of classification systems, and the use of a ratio mask instead of a binary mask as the training target (Madhu et al, 2013;Bentsen et al, 2018). However, all of these algorithms made use of some a priori information about the target speech and/or interfering noise by using the same target speaker (Lai et al, 2018;Chen et al, 2016), background noise (Goehring et al, 2017), or both (Kim et al, 2009;Hu and Loizou, 2010;Healy et al, 2013;Healy et al, 2015;Healy et al, 2019;Goehring et al, 2017;Lai et al, 2018;Bramsløw et al, 2018;Bentsen et al, 2018) for the training and testing of the algorithm.…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations