2021
DOI: 10.1177/23312165211041475
|View full text |Cite
|
Sign up to set email alerts
|

Transient Noise Reduction Using a Deep Recurrent Neural Network: Effects on Subjective Speech Intelligibility and Listening Comfort

Abstract: A deep recurrent neural network (RNN) for reducing transient sounds was developed and its effects on subjective speech intelligibility and listening comfort were investigated. The RNN was trained using sentences spoken with different accents and corrupted by transient sounds, using the clean speech as the target. It was tested using sentences spoken by unseen talkers and corrupted by unseen transient sounds. A paired-comparison procedure was used to compare all possible combinations of three conditions for sub… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

1
5
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(6 citation statements)
references
References 31 publications
1
5
0
Order By: Relevance
“…[49][50][51][52][53] ANNs are typically used to process sound, images, and videos. [54][55][56][57][58][59] Recurrent neural networks (RNNs) are a type of NNs where connections between nodes can generate a cycle, that lets the output from some nodes impact the following input to the same nodes. 43 In other words, RNNs have a memory and offer a temporal dynamic behavior.…”
Section: Introductionmentioning
confidence: 99%
See 3 more Smart Citations
“…[49][50][51][52][53] ANNs are typically used to process sound, images, and videos. [54][55][56][57][58][59] Recurrent neural networks (RNNs) are a type of NNs where connections between nodes can generate a cycle, that lets the output from some nodes impact the following input to the same nodes. 43 In other words, RNNs have a memory and offer a temporal dynamic behavior.…”
Section: Introductionmentioning
confidence: 99%
“…Indeed, RNNs are FFNNs that can inherently process variable-length input sequences. 54 A subfield of ML is deep learning (DL) (Figure 1) which is based on ANNs where multiple layers are employed for data processing to obtain progressively higher-level features from data. 60 A DL model can perform complex functions due to having more layers and nodes within a layer.…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations
“…However, other studies show that an increase in discomfort is noticed only within a relative comparison between the aided and unaided condition, and not within a comparison with normal-hearing listeners [13,14]. There are several studies evaluating the benefit of INR, however, the methods and outcomes differ among these studies [15][16][17][18][19]. Most studies report a positive effect of INR when test subjects were asked to rate annoyance, sound quality or loudness comfort.…”
Section: Introductionmentioning
confidence: 99%