2020
DOI: 10.1007/978-981-15-4828-4_18
|View full text |Cite
|
Sign up to set email alerts
|

Development of a Novel Database in Gujarati Language for Spoken Digits Classification

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
7
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 10 publications
(7 citation statements)
references
References 20 publications
0
7
0
Order By: Relevance
“…Additionally, our proposed CNN showed faster convergence, as in Figure 7. For evaluating the effectiveness of our proposed CNN on different languages, we performed the experiments on Gujarati [29] and English [41,42] spoken digits. Our proposed CNN also outperformed the Gujarati Digit Model by absolute 22% accuracy.…”
Section: Classification Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…Additionally, our proposed CNN showed faster convergence, as in Figure 7. For evaluating the effectiveness of our proposed CNN on different languages, we performed the experiments on Gujarati [29] and English [41,42] spoken digits. Our proposed CNN also outperformed the Gujarati Digit Model by absolute 22% accuracy.…”
Section: Classification Resultsmentioning
confidence: 99%
“…The results are promising, with a 69 percent of overall recognition accuracy. Dalsaniya et al [29] presented a novel, publicly available audio dataset of spoken digits in the Gujarati language, as well as some preliminary results. A comparison of the categorization algorithm with an English language database was also performed.…”
Section: Related Workmentioning
confidence: 99%
“…These corpora have been made available to serve a wide range of purposes. (Dalsaniya et al, 2020) introduces an innovative audio dataset comprising isolated Gujarati digits. This database encompasses recordings of digits spoken by 20 individuals from five distinct regions within Gujarat, captured in real-world environments.…”
Section: Literature Reviewmentioning
confidence: 99%
“…The Free Spoken Digit Gujarati Dataset prompts the conclusion that, following cross-corpus experiments conducted by the research team, there exists substantial potential for enhancing the dataset's capabilities. This enhancement involves exploring diverse features and techniques to augment the system's ability to generalize effectively across various contexts (Dalsaniya et al, 2020).…”
Section: Performance Evaluationmentioning
confidence: 99%
“…We used 3-layers CNN, EfficientNet [38] and mobileNet [9]. For a fair comparison, we took the same model architectures that were defined in [1,5]. For the CNN model, we implemented a 3-layer architecture as described in [1] and shown in Table 1.…”
Section: Algorithmsmentioning
confidence: 99%