2016 IEEE International Conference on Big Data (Big Data) 2016
DOI: 10.1109/bigdata.2016.7841061
|View full text |Cite
|
Sign up to set email alerts
|

A grapheme-level approach for constructing a Korean morphological analyzer without linguistic knowledge

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 6 publications
(3 citation statements)
references
References 10 publications
0
3
0
Order By: Relevance
“…As Korean is agglutinative (Song, 2006), the current literature in Korean word representations mainly focus on subword structures such as morphemes (Edmiston and Stratos, 2018), syllables (Choi et al, 2017) and Jamo (Choi et al, 2016;Stratos, 2017;Park et al, 2018). We here move forward one step further by incorporating Hanja information explicitly together with the aforementioned subword information.…”
Section: Related Workmentioning
confidence: 99%
“…As Korean is agglutinative (Song, 2006), the current literature in Korean word representations mainly focus on subword structures such as morphemes (Edmiston and Stratos, 2018), syllables (Choi et al, 2017) and Jamo (Choi et al, 2016;Stratos, 2017;Park et al, 2018). We here move forward one step further by incorporating Hanja information explicitly together with the aforementioned subword information.…”
Section: Related Workmentioning
confidence: 99%
“…Dong, et al (2016) demonstrate how radical-level features incorporated at the character-level for named entity recognition achieve state-of-the-art accuracy for Chinese. The most convincing attempt to tag Korean at the morpheme level is by Choi, et al (2016) who achieve state-of-the-art (dictionary-less) performance by using a multi-stage Bi-LSTM-CRF model that involves the splitting of Korean character input into constituent graphemes. However, the implicit assumption that Korean characters must first be split into graphemes to achieve optimal performance for morphological analysis is not well supported, and we should consider the splitting of characters into graphemes to be employing linguistic knowledge specific to Korean.…”
Section: Related Workmentioning
confidence: 99%
“…Traditional approaches to Korean morphological analysis have adopted a pipeline model of morpheme processing and POS tagging (Lee and Rim, 2009;Na, 2015;Choi et al, 2016;Matteson et al, 2018;Song and Park, 2018 Figure 1: Korean morphological analysis of a sentence "나는 하늘을 나는 새를 봤다" of which meaning is "I saw a bird flying in the sky". Correct morpheme analysis helps predicting POS tags (blue dotted arrows) while POS tagging affects morpheme analysis (red arrows).…”
Section: Introductionmentioning
confidence: 99%