2009
DOI: 10.1007/978-3-642-00831-3_2
|View full text |Cite
|
Sign up to set email alerts
|

CRF Models for Tamil Part of Speech Tagging and Chunking

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2010
2010
2023
2023

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 21 publications
(3 citation statements)
references
References 6 publications
0
3
0
Order By: Relevance
“…ROLE was a frequent scene element in comparison with STATIC/ANIMATED-OBJECT or other scene elements in those stories. The CRF model can handle this imbalanced data and learn the elements with a small number of samples, as in NER (Finkel, Grenager, and Manning 2005) and POS tagging problems (Pandian and Geetha 2009). The increase in the average accuracy of the mapping task in the third attempt (sequential modeling), that is, 85.7% compared with the second attempt (non-sequential modeling), that is, 76.58%, confirmed the ability of CRF to model sequential and imbalanced data.…”
Section: Conditional Random Fieldsmentioning
confidence: 99%
“…ROLE was a frequent scene element in comparison with STATIC/ANIMATED-OBJECT or other scene elements in those stories. The CRF model can handle this imbalanced data and learn the elements with a small number of samples, as in NER (Finkel, Grenager, and Manning 2005) and POS tagging problems (Pandian and Geetha 2009). The increase in the average accuracy of the mapping task in the third attempt (sequential modeling), that is, 85.7% compared with the second attempt (non-sequential modeling), that is, 76.58%, confirmed the ability of CRF to model sequential and imbalanced data.…”
Section: Conditional Random Fieldsmentioning
confidence: 99%
“…However, this is difficult to use on real data due to the complexity of natural languages. Some works are based on linear statistic models, such as Conditional Random Fields (CRF) [13] and Hidden Markov [14]. These statistic models perform relatively well on the corpora tagged with a coarse-grained tagset, but they do not perform as well as the Bi-LSTM on the corpora tagged with a fine-grained tagset [15].…”
Section: Pos Taggingmentioning
confidence: 99%
“…Recently among Asian languages, several supervised learning techniques with acceptable performance have been proposed. For PoS tagging, Pandian and Geetha [1] utilized conditional random fields (CRFs), a probabilistic model, to segment and label sequence data, to tag and chunk PoS in Tamil. Huang et al [2] showed that a bigram PoS tagger using latent annotations could achieve the accuracy of 94.78% when testing on a set of the Penn Chinese Treebank 6.0.…”
Section: Introductionmentioning
confidence: 99%