2010
DOI: 10.7763/ijcte.2010.v2.198
|View full text |Cite
|
Sign up to set email alerts
|

Sindhi Part of Speech Tagging System Using Wordnet

Abstract: Sindhi is highly homographic language, the text is written without diacritics in real life applications, that creates lexical and morphological ambiguity. It is a most critical problem facing Sindhi computational processing and difficult to assign correct syntactic category in the text. Lot of work has been done for diacritic restorations by using statistical and linguistics approaches, still results are not on acceptable level. Tagging the non-diacritic words can be solved using semantic knowledge. This paper… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2011
2011
2024
2024

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 12 publications
(7 citation statements)
references
References 8 publications
0
7
0
Order By: Relevance
“…The phonological systems of other Indo-Aryan languages are resembled mostly. In Sindhi language, there are 10 vowels and 43 consonants phonemes are unique [6]. When a person speaks in microphone are on telephone, the speech acquisition starts.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…The phonological systems of other Indo-Aryan languages are resembled mostly. In Sindhi language, there are 10 vowels and 43 consonants phonemes are unique [6]. When a person speaks in microphone are on telephone, the speech acquisition starts.…”
Section: Methodsmentioning
confidence: 99%
“…To develop systems and techniques for speech input to machine the main aim is the speech recognition area. The spoken words or samples are needed to be collected from various regions and areas so that the different accents can be used on the basis of environment [6].…”
Section: Introductionmentioning
confidence: 99%
“…The results have been presented by applying WordNet and without WordNet and an overall accuracy has been reported as 96.28% without net and 97.14 with word net. The results have been presented with training, testing corpus and unknown words [3]. A morphological analyzer is proposed for Sindhi language by [4].…”
Section: Pos Tagging For Various Regional Languages Of Pakistan Sindh...mentioning
confidence: 99%
“…A study presented in [3] applied POS tagging for Sindhi language. The study highlighted the characteristics of Sindhi languages pertaining to POS tagging system such as the lexical and morphological ambiguity.…”
Section: Sindhi Pos Tagging Systemmentioning
confidence: 99%
“…Sindhi is a less resourced language [3,4] in comparison of English language. Nevertheless, some work has been done on tokenization and POS tagging of Sindhi text [5][6][7] as well as NLP tools are accessible online for solution of Sindhi linguistic problems [7]. In this connection, Sindhi Devanagari script [8]…”
Section: Introductionmentioning
confidence: 99%