2019
DOI: 10.1007/s10462-019-09775-8
|View full text |Cite
|
Sign up to set email alerts
|

ASRoIL: a comprehensive survey for automatic speech recognition of Indian languages

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
12
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
2
2
1

Relationship

2
8

Authors

Journals

citations
Cited by 40 publications
(12 citation statements)
references
References 123 publications
0
12
0
Order By: Relevance
“…In addition to accessibility, speech signals contain many other emotional cues [4]. Although speech signals contain substantial amounts of information, it can be unrewarding to drop the linguistic component that coexists with it, especially given that the text component can be easily transcribed in real world applications with the considerable successes in the domain of speech-to-text with several commercial-scale APIs being available [5].…”
Section: Introductionmentioning
confidence: 99%
“…In addition to accessibility, speech signals contain many other emotional cues [4]. Although speech signals contain substantial amounts of information, it can be unrewarding to drop the linguistic component that coexists with it, especially given that the text component can be easily transcribed in real world applications with the considerable successes in the domain of speech-to-text with several commercial-scale APIs being available [5].…”
Section: Introductionmentioning
confidence: 99%
“…This paper tried to explore state-of-art speech recognition with respect to different feature and modeling approaches in Indian and non-Indian languages across the world. Singh et al [306] conducted a review of the spoken languages of India. The survey was conducted based on the relevant research articles published from 2000 to 2018.…”
Section: Inclusion/exclusion Criteriamentioning
confidence: 99%
“…Several works on ASR system were carried out in the past decade using machine learning and deep learning models [1]. Deep speech and Listen Attend Spell is a end to end ASR models using neural network techniques [2][3].…”
Section: Figure 1: Audio Speech Recognitionmentioning
confidence: 99%