2020
DOI: 10.5715/jnlp.27.801
|View full text |Cite
|
Sign up to set email alerts
|

Language Resources for Japanese Lexical Simplification

Abstract: This study introduces three language resources for Japanese lexical simplification: 1) an evaluation dataset, 2) lexica, and 3) a toolkit that can be used to develop and benchmark Japanese lexical simplification systems. The word complexity lexicon adopted in this study was automatically expanded using a classifier trained on a small word complexity lexicon created by Japanese language teachers. Based on this word complexity estimator, simplified word pairs were extracted from a large-scale synonym lexicon, an… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
6
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(6 citation statements)
references
References 30 publications
0
6
0
Order By: Relevance
“…We presented various use cases and applications of lexical complexity prediction, including for other NLP-related tasks such as sentiment analysis (Section 5.3.1), author identification (Section 5.3.2), and machine translation (Section 5.3.3). We collected and summarized English datasets used for LCP (Section 6.1) and also briefly presented work on languages other than English (Section 6.2): Chinese [76], Japanese [93], and Swedish [124].…”
Section: Discussionmentioning
confidence: 99%
See 4 more Smart Citations
“…We presented various use cases and applications of lexical complexity prediction, including for other NLP-related tasks such as sentiment analysis (Section 5.3.1), author identification (Section 5.3.2), and machine translation (Section 5.3.3). We collected and summarized English datasets used for LCP (Section 6.1) and also briefly presented work on languages other than English (Section 6.2): Chinese [76], Japanese [93], and Swedish [124].…”
Section: Discussionmentioning
confidence: 99%
“…JEV contains 18,000 thousand Japanese words divided into three levels of difficulty: easy, medium, or difficult. Nishihara and Kajiwara [93] also rated the complexity of words from Japanese Wikiepedia, the Tsukuba Web Corpus [94], and the Corpus of Contemporary Written Japanese [83]. This increased the size of their dataset to 40,605 Japanese words.…”
Section: 21mentioning
confidence: 99%
See 3 more Smart Citations