2022
DOI: 10.5753/jidm.2022.2514
|View full text |Cite
|
Sign up to set email alerts
|

Evaluation of Automatic Speech Recognition Approaches

Abstract: Automatic Speech Recognition (ASR) is essential for many applications like automatic caption generation for videos, voice search, voice commands for smart homes, and chatbots. Due to the increasing popularity of these applications and the advances in deep learning models for transcribing speech into text, this work aims to evaluate the performance of commercial solutions for ASR that use deep learning models, such as Facebook Wit.ai, Microsoft Azure Speech, Google Cloud Speech-to-Text, Wav2Vec, and AWS Transcr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 12 publications
0
1
0
Order By: Relevance
“…Pasandi et al highlighted that conversational speech is the most challenging and environmentally relevant type of data for speech recognition. Pires et al constructed the Portuguese Evaluation Benchmark [22] using the Mozilla Common Voice and Voxforge datasets and five commercial ASR engines. Mihajlik et al conducted an evaluation of open-source Hungarian ASR systems using a comprehensive linguistic dataset [8].…”
Section: B Asr Benchmarksmentioning
confidence: 99%
“…Pasandi et al highlighted that conversational speech is the most challenging and environmentally relevant type of data for speech recognition. Pires et al constructed the Portuguese Evaluation Benchmark [22] using the Mozilla Common Voice and Voxforge datasets and five commercial ASR engines. Mihajlik et al conducted an evaluation of open-source Hungarian ASR systems using a comprehensive linguistic dataset [8].…”
Section: B Asr Benchmarksmentioning
confidence: 99%