Proceedings of the 2018 Conference of the North American Chapter Of the Association for Computational Linguistics: Hu 2018
DOI: 10.18653/v1/n18-1019
|View full text |Cite
|
Sign up to set email alerts
|

Simplification Using Paraphrases and Context-Based Lexical Substitution

Abstract: Lexical simplification involves identifying complex words or phrases that need to be simplified, and recommending simpler meaningpreserving substitutes that can be more easily understood. We propose a complex word identification (CWI) model that exploits both lexical and contextual features, and a simplification mechanism which relies on a wordembedding lexical substitution model to replace the detected complex words with simpler paraphrases. We compare our CWI and lexical simplification models to several base… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
12
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
5
4

Relationship

1
8

Authors

Journals

citations
Cited by 18 publications
(12 citation statements)
references
References 16 publications
0
12
0
Order By: Relevance
“…Fader et al (2013) and Dong et al (2017) employ paraphrase knowledge to enhance question answering models. Kriz et al (2018) utilize paraphrase and context-based lexical substitution knowledge to improve simplification task. Similarly, Zhao et al (2018a) combine paraphrase rules of PPDB (Ganitkevitch et al, 2013) with Transformer (Vaswani et al, 2017) to perform sentence simplification task.…”
Section: Related Workmentioning
confidence: 99%
“…Fader et al (2013) and Dong et al (2017) employ paraphrase knowledge to enhance question answering models. Kriz et al (2018) utilize paraphrase and context-based lexical substitution knowledge to improve simplification task. Similarly, Zhao et al (2018a) combine paraphrase rules of PPDB (Ganitkevitch et al, 2013) with Transformer (Vaswani et al, 2017) to perform sentence simplification task.…”
Section: Related Workmentioning
confidence: 99%
“…Extending the complex word identification model of Kriz et al (2018), we train a linear regression model using length, number of syllables, and word frequency; we also include Word2Vec embeddings (Mikolov et al, 2013). To collect data for this task, we consider the Newsela corpus, a collection of 1,840 news articles written by professional editors at 5 reading levels .…”
Section: Word Complexity Predictionmentioning
confidence: 99%
“…Lexical simplification: Prior work on lexical simplification depends on lexical and corpusbased features to assess word complexity. For complex word identification, there are broadly two lines of research: learning a frequency-based threshold over a large corpus (Shardlow, 2013b) or training an ensemble of classifiers over a combination of lexical and language model features (Shardlow, 2013a;Paetzold and Specia, 2016a;Yimam et al, 2017;Kriz et al, 2018). Substitution ranking also follows similar trend.…”
Section: Related Workmentioning
confidence: 99%