Proceedings of SLPAT 2015: 6th Workshop on Speech and Language Processing for Assistive Technologies 2015
DOI: 10.18653/v1/w15-5113
|View full text |Cite
|
Sign up to set email alerts
|

Pronunciation Adaptation For Disordered Speech Recognition Using State-Specific Vectors of Phone-Cluster Adaptive Training

Abstract: Pronunciation variation is a major problem in disordered speech recognition. This paper focus on handling the pronunciation variations in dysarthric speech by forming speaker-specific lexicons. A novel approach is proposed for identifying mispronunciations made by each dysarthric speaker, using state-specific vector (SSV) of phone-cluster adaptive training (Phone-CAT) acoustic model. SSV is low-dimensional vector estimated for each tied-state where each element in a vector denotes the weight of a particular mo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

0
4
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(4 citation statements)
references
References 21 publications
0
4
0
Order By: Relevance
“…Sriranjani et al [87] proposed a new approach to identify the pronunciation errors of each speaker with dysarthria, using the state-specific vector (SSV) of the acoustic model trained using phoneme cluster adaptation (Phone-CAT). SSV was a low-dimensional vector estimated for each binding state, with each element in the vector representing the weight of a specific mono-phoneme.…”
Section: Language-lexical Model Of Asr For Dysarthric Speechmentioning
confidence: 99%
“…Sriranjani et al [87] proposed a new approach to identify the pronunciation errors of each speaker with dysarthria, using the state-specific vector (SSV) of the acoustic model trained using phoneme cluster adaptation (Phone-CAT). SSV was a low-dimensional vector estimated for each binding state, with each element in the vector representing the weight of a specific mono-phoneme.…”
Section: Language-lexical Model Of Asr For Dysarthric Speechmentioning
confidence: 99%
“…Their approach reduced world error rate by 5.93% and 13.68% compared to baseline and error correction approaches with context-independent confusion matrices, respectively. Sriranjani, et al [120] proposed to use the state specific vector (SSV) of the acoustic model trained by phoneme cluster adaptation (Phone-CAT) to identify the pronunciation errors of each speaker with dysarthria. SSV is a low-dimensional vector estimated for each binding state, with each element representing the weight of a specific mono-phoneme.…”
Section: Language and Lexical Model Of Asr For Dysarthric Speechmentioning
confidence: 99%
“…There are many examples of assistive technology that rely on speech and natural language processing. For instance, sign language translation (Camgoz et al, 2018), pronunciation adaptation for disordered speech (Sriranjani et al, 2015) and synthesised voices for individuals with vocal disabilities (Veaux et al, 2013). Text simplification is an area of natural language processing concerned with the simplification of textual information and is often recognised as having assistive applications.…”
Section: Introductionmentioning
confidence: 99%