2018
DOI: 10.1080/17549507.2018.1510033
|View full text |Cite
|
Sign up to set email alerts
|

Automatic speech recognition: A primer for speech-language pathology researchers

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4

Citation Types

0
11
0
6

Year Published

2018
2018
2024
2024

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 20 publications
(17 citation statements)
references
References 55 publications
0
11
0
6
Order By: Relevance
“…In terms of resonance and articulation, these changes are typically embodied in vowel formant frequencies and acoustically perceived speech sounds 4 . Such speech indexes, in turn, carry abundant information about disease status 5 . Ideally, speech data are capable of indicating thorough details about the lesions, including the location, size, and degree of invasion.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…In terms of resonance and articulation, these changes are typically embodied in vowel formant frequencies and acoustically perceived speech sounds 4 . Such speech indexes, in turn, carry abundant information about disease status 5 . Ideally, speech data are capable of indicating thorough details about the lesions, including the location, size, and degree of invasion.…”
Section: Introductionmentioning
confidence: 99%
“…Therefore, selecting proper acoustic features is of utmost importance. Abundant features of different physiological or psychological interpretations can be extracted based on acoustic, spectral, and cepstral measures from the speech signal 5 . Acoustic features typically include fundamental frequency (F0) and formant frequencies.…”
Section: Introductionmentioning
confidence: 99%
“…Nowadays, it is indispensable not only for private use but also for various professions [ 3 ]. In several medical specialties, ASR has been tested and implemented in speech-language pathology research, diagnostics and therapeutics such as in speech apraxia [ 4 , 5 ]. Among medical professionals, ASR is commonly used to convert speech into text for data entries and has already been sufficiently tested in a mobile environment.…”
Section: Introductionmentioning
confidence: 99%
“…Developing dysarthric automatic speech recognition (ASR) is considered to be a challenging task due to the intra-and interspeaker variability in dysarthric speech and the difficulty of obtaining suitable data, that is, matched and in sufficient quantity [3,4]. Researchers have been working on developing and improving dysarthric ASR systems and employing different techniques to overcome the challenges such as using adaptation techniques and speaker dependent models [5,6,7,3]. None of those studies have focused on recognising any of the non-verbal, precisely paralinguistic, information such as the emotional state of the speaker.…”
Section: Introductionmentioning
confidence: 99%