Proceedings of the Eighth Workshop on Speech and Language Processing for Assistive Technologies 2019
DOI: 10.18653/v1/w19-1706
|View full text |Cite
|
Sign up to set email alerts
|

Investigating Speech Recognition for Improving Predictive AAC

Abstract: Making good letter or word predictions can help accelerate the communication of users of high-tech AAC devices. This is particularly important for real-time person-to-person conversations. We investigate whether performing speech recognition on the speakingside of a conversation can improve language model based predictions. We compare the accuracy of three plausible microphone deployment options and the accuracy of two commercial speech recognition engines (Google and IBM Watson). We found that despite recogni… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 15 publications
0
1
0
Order By: Relevance
“…For text entry in AAC, Wisenburn and Higginbotham (2008) demonstrated that providing noun phrases from a conversation partner's speech as selection options increases text-entry speed by 36.7%. Adhikary et al (2019) concluded that with currently-attainable accuracy of ASR, partner speech can be valuable in improving language modeling for AAC text entry. Shen et al (2022) used a fine-tuned GPT-2 model (Radford et al, 2019) to expand bags of keywords into full phrases in conversational contexts based on the ConvAI2 dataset and reported a KSR of 77% at a word error rate threshold of 0.65.…”
Section: Introductionmentioning
confidence: 99%
“…For text entry in AAC, Wisenburn and Higginbotham (2008) demonstrated that providing noun phrases from a conversation partner's speech as selection options increases text-entry speed by 36.7%. Adhikary et al (2019) concluded that with currently-attainable accuracy of ASR, partner speech can be valuable in improving language modeling for AAC text entry. Shen et al (2022) used a fine-tuned GPT-2 model (Radford et al, 2019) to expand bags of keywords into full phrases in conversational contexts based on the ConvAI2 dataset and reported a KSR of 77% at a word error rate threshold of 0.65.…”
Section: Introductionmentioning
confidence: 99%