2015
DOI: 10.1007/978-3-319-23980-4_3
|View full text |Cite
|
Sign up to set email alerts
|

Morphological Disambiguation of Classical Sanskrit

Abstract: The paper describes a new tagset for the morphological disambiguation of Sanskrit, and compares the accuracy of two machine learning methods (CRF, deep recurrent neural networks) for this task, with a special focus on how to model the lexicographic information. It reports a significant improvement over previously published results.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
2
1

Relationship

2
1

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 19 publications
0
3
0
Order By: Relevance
“…Most NLP systems for SWS combine Pān . inis phonetic and morphological rules with a lexical resource, either by using formal (Huet, 2005;Goyal et al, 2009;Kulkarni and Shukla, 2009) or statistical methods, including Dirichlet processes (Natarajan and Charniak, 2011), finite state methods (Mittal, 2010), graph queries (Krishna et al, 2016) and hybrid systems (Hellwig, 2015a).…”
Section: Related Researchmentioning
confidence: 99%
“…Most NLP systems for SWS combine Pān . inis phonetic and morphological rules with a lexical resource, either by using formal (Huet, 2005;Goyal et al, 2009;Kulkarni and Shukla, 2009) or statistical methods, including Dirichlet processes (Natarajan and Charniak, 2011), finite state methods (Mittal, 2010), graph queries (Krishna et al, 2016) and hybrid systems (Hellwig, 2015a).…”
Section: Related Researchmentioning
confidence: 99%
“…Recently, Tkachenko and Sirts (2018) proposed to generate this sequence of morphological features using a neural encoder-decoder architecture. Hellwig (2016) shows a significant improvement in performance for morphological tagging in Sanskrit by using a monolithic tagset with recurrent neural network based tagging model. In systems using monolithic labels, multiple feature values pertaining to a word are combined to form a single label (Müller et al, 2013;, which leads to data sparsity for morphologically rich languages such as Czech, Turkish and Sanskrit.…”
Section: Introductionmentioning
confidence: 98%
“…The language relies heavily on morphological markers to determine the syntactic, and to some extent the semantic roles, of words in a sentence. There exist limited and partly incompatible solutions (Hellwig, 2016;Goyal and Huet, 2016;Krishna et al, 2018) for morphological tagging of Sanskrit that heavily rely on lexicon driven shallow parsers and other linguistic knowledge. However recently, neural sequential labelling models have achieved competitive results in morphological tagging for multiple languages (Cotterell and Heigold, 2017;Tkachenko and Sirts, 2018;Malaviya et al, 2018).…”
Section: Introductionmentioning
confidence: 99%