2022
DOI: 10.1007/978-3-031-11035-1_2
|View full text |Cite
|
Sign up to set email alerts
|

Improving Automatic Speech Recognition for Non-native English with Transfer Learning and Language Model Decoding

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2
1
1
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(1 citation statement)
references
References 41 publications
0
1
0
Order By: Relevance
“…Additionally, the absence of clear auditory punctuation cues necessitates that LLMs infer sentence boundaries and pauses contextually, a task fraught with potential for error [4]. Previous research has highlighted the complexities involved in speech recognition tasks, noting that even state-of-the-art models struggle with certain phonetic complexities and punctuation inference [5], [6]. The findings from this study will contribute to the existing body of knowledge by providing a comparative analysis of two leading multimodal models, offering insights into their strengths and limitations.…”
Section: Introductionmentioning
confidence: 99%
“…Additionally, the absence of clear auditory punctuation cues necessitates that LLMs infer sentence boundaries and pauses contextually, a task fraught with potential for error [4]. Previous research has highlighted the complexities involved in speech recognition tasks, noting that even state-of-the-art models struggle with certain phonetic complexities and punctuation inference [5], [6]. The findings from this study will contribute to the existing body of knowledge by providing a comparative analysis of two leading multimodal models, offering insights into their strengths and limitations.…”
Section: Introductionmentioning
confidence: 99%