2017
DOI: 10.1109/taslp.2017.2738445
|View full text |Cite
|
Sign up to set email alerts
|

Perceptual Information Loss due to Impaired Speech Production

Abstract: Abstract-Phonological classes define articulatory-free and articulatory-bound phone attributes. Deep neural network is used to estimate the probability of phonological classes from the speech signal. In theory, a unique combination of phone attributes form a phoneme identity. Probabilistic inference of phonological classes thus enables estimation of their compositional phoneme probabilities. A novel information theoretic framework is devised to quantify the information conveyed by each phone attribute, and ass… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
8
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 17 publications
(8 citation statements)
references
References 37 publications
0
8
0
Order By: Relevance
“…In Some Cases, Studies Incorporate More Than One Technique. Following That, the Same Study Was Replicated, Thereby Increasing the Overall Number of Research Studies Method/Technique % Reference MFCCs/derived features from MFCCs 35.4% [ 13 , 15 , 55 , 70 , 72 , 80 , 87 , 94 , 95 , 97–99 , 101 , 112 , 117 , 127 , 131 , 133 , 134 ] Spectro-Temporal of utterances/keywords 26.15% [ 12 , 14 , 15 , 26 , 63 , 72 , 87 , 89 , 90 , 97 , 101–103 , 110 , 112 , 124 , 136 ] Articulation way/Speech timing 18.5% [ 13 , 27 , 58 , 81 , 86–88 , 111 , 113 , 124 , 129 ] Glottal flow 7.7% [ 58 , …”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…In Some Cases, Studies Incorporate More Than One Technique. Following That, the Same Study Was Replicated, Thereby Increasing the Overall Number of Research Studies Method/Technique % Reference MFCCs/derived features from MFCCs 35.4% [ 13 , 15 , 55 , 70 , 72 , 80 , 87 , 94 , 95 , 97–99 , 101 , 112 , 117 , 127 , 131 , 133 , 134 ] Spectro-Temporal of utterances/keywords 26.15% [ 12 , 14 , 15 , 26 , 63 , 72 , 87 , 89 , 90 , 97 , 101–103 , 110 , 112 , 124 , 136 ] Articulation way/Speech timing 18.5% [ 13 , 27 , 58 , 81 , 86–88 , 111 , 113 , 124 , 129 ] Glottal flow 7.7% [ 58 , …”
Section: Discussionmentioning
confidence: 99%
“…One significant gap lies in the limited availability of large, balanced, and high-quality annotated datasets for training and evaluating machine learning models for speech disorders. 7 , 11 , 12 , 102 , 105 , 117 , 120 , 123 , 138 The scarcity of such datasets limits the generalizability and validity of the results, leading to increased ambiguity and reduced statistical control. Moreover, it hinders the ability to draw strong inferences and develop and evaluate robust algorithms.…”
Section: Gaps and Future Directionsmentioning
confidence: 99%
See 1 more Smart Citation
“…A RTIFICIAL neural networks are applied to a lot of fields, such as machine translation [1], speech recognition [2], object detection [3], robotics [4]- [6], intelligent control [7], etc. In neural networks, the gradient-descent-based algorithm is one of the most widely used optimization algorithms [8].…”
Section: Introductionmentioning
confidence: 99%
“…Their study focuses on the low-frequency region that is lower than 4000 Hz in the speech spectrum. However, the changes in the place of articulation and movement of articulators are closely related to high-frequency components of speech signals [27][28][29]. The distribution information in the high-frequency part of the speech signal is not considered in their research.…”
Section: Introductionmentioning
confidence: 99%