2023
DOI: 10.1016/j.dib.2023.109014
|View full text |Cite
|
Sign up to set email alerts
|

A vast dataset for Kurdish handwritten digits and isolated characters recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(2 citation statements)
references
References 7 publications
0
2
0
Order By: Relevance
“…The Kurdish language is classified as less resourced in terms of natural language processing (NLP). The similar datasets for other languages previously were conducted, but the sources for the Kurdish languages is inferior and a small number of the dataset available related to the language [2 , 3] . The language needs essential tools such as name recognition, lemmatization, POS tagger, etc.…”
Section: Objectivementioning
confidence: 99%
See 1 more Smart Citation
“…The Kurdish language is classified as less resourced in terms of natural language processing (NLP). The similar datasets for other languages previously were conducted, but the sources for the Kurdish languages is inferior and a small number of the dataset available related to the language [2 , 3] . The language needs essential tools such as name recognition, lemmatization, POS tagger, etc.…”
Section: Objectivementioning
confidence: 99%
“…Thus, the language requires its unique tool for performing such tasks. Word tokenization is another process acquired from using KLPT (Kurdish Language Processing Toolkit) [2] . This tool tokenizes Kurdish texts according to the morphological features of the language.…”
Section: Dataset Preprocessingmentioning
confidence: 99%