2022
DOI: 10.1007/978-3-030-94188-8_24
|View full text |Cite
|
Sign up to set email alerts
|

Toward an End-to-End Voice to Sign Recognition for Dialect Moroccan Language

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
5
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 7 publications
(6 citation statements)
references
References 14 publications
0
5
0
Order By: Relevance
“…The authors also explored works related to Moroccan Darija resources, such as lexicons, corpora, and dialect identification, emphasizing the importance of annotated corpora for further research in this area.  In 2021, the Dvoice dataset 1.0, developed by SI2M labs in collaboration with AIOX Lab [11] [12] proposed a solution based on the state-of-the-art architecture named deep speech 2 by Baidu. They conducted tests on 24 hours of speech data (DarSpeech) and achieved promising results, with a word error rate (WER) of 22.7% and a character error rate of 6.03%.…”
Section: Related Workmentioning
confidence: 99%
“…The authors also explored works related to Moroccan Darija resources, such as lexicons, corpora, and dialect identification, emphasizing the importance of annotated corpora for further research in this area.  In 2021, the Dvoice dataset 1.0, developed by SI2M labs in collaboration with AIOX Lab [11] [12] proposed a solution based on the state-of-the-art architecture named deep speech 2 by Baidu. They conducted tests on 24 hours of speech data (DarSpeech) and achieved promising results, with a word error rate (WER) of 22.7% and a character error rate of 6.03%.…”
Section: Related Workmentioning
confidence: 99%
“…The dataset was highly unbalanced, and the original data for languages such as Darija, Yoruba, and Sudanese were extremely low. Thereafter, we conduct experiments based on augmented data using the augmentation method proposed in [27], which adds a perturbation, augmentation lobe and also corrupts speech features by adding a frequency and a time dropout.…”
Section: Language Choices and Datasetsmentioning
confidence: 99%
“…We use the ALFFA Public [24][25][26] dataset for the Amharic, Fongbe, Swahili and Wolof languages (see Table 2). For Darija, the DVoice dataset was used [27], a dataset created using pseudo-labelling and data augmentation techniques. The Sudanese dataset was collected from Youtube videos then labeled manually.…”
Section: Language Choices and Datasetsmentioning
confidence: 99%
See 1 more Smart Citation
“…In the paper [1] Authors presents building an automated voice to sign system based on pose estimation but generating more own animations for not available poses can improvise the system. In paper [2] authors discussed on text to sign conversion for Arabic sign language and it can be extended to other languages with more vocabulary.…”
Section: Related Workmentioning
confidence: 99%