Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing 2018
DOI: 10.18653/v1/d18-1289
|View full text |Cite
|
Sign up to set email alerts
|

Is this Sentence Difficult? Do you Agree?

Abstract: In this paper, we present a crowdsourcingbased approach to model the human perception of sentence complexity. We collect a large corpus of sentences rated with judgments of complexity for two typologically-different languages, Italian and English. We test our approach in two experimental scenarios aimed to investigate the contribution of a wide set of lexical, morpho-syntactic and syntactic phenomena in predicting i) the degree of agreement among annotators independently from the assigned judgment and ii) the … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
29
0
1

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
3
1

Relationship

1
7

Authors

Journals

citations
Cited by 32 publications
(35 citation statements)
references
References 14 publications
1
29
0
1
Order By: Relevance
“…We first collected an appropriate corpus to evaluate the effect of context on the perception of sentence complexity. We started from the crowdsourced dataset by Brunato et al (2018), which contains 1, 200 sentences annotated for perceived complexity on a 7-point Likert scale. We similarly built a crowdsourcing task, asking native English speakers to read each sentence and rate its complexity on the same scale.…”
Section: Approachmentioning
confidence: 99%
See 2 more Smart Citations
“…We first collected an appropriate corpus to evaluate the effect of context on the perception of sentence complexity. We started from the crowdsourced dataset by Brunato et al (2018), which contains 1, 200 sentences annotated for perceived complexity on a 7-point Likert scale. We similarly built a crowdsourcing task, asking native English speakers to read each sentence and rate its complexity on the same scale.…”
Section: Approachmentioning
confidence: 99%
“…From a human-based perspective, sentence complexity is assessed by measures of processing effort or performance in behavioral tasks. In this respect, a large part of studies has focused on reading single sentences and correlating syntactic and lexical properties with observed difficulty, being it captured by cognitive signals, such as eye-tracking metrics (Rayner, 1998;King and Just, 1991), or by explicit judgments of complexity given by readers (Brunato et al, 2018). However, models of language comprehension underline the importance of contextual cues, such as the presence of explicit cohesive devices, in building a coherent representation of a text (Kintsch et al, 1975;McNamara, 2001).…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Recently, (Stajner et al, 2017) and evaluated the RPSL task with a huge dataset -the Newsela dataset, comprising 550 thousand sentences, three times greater than Wikipedia-SimpleWikipedia. (Brunato et al, 2018) evaluated the perception of complexity and agreement between annotators, while (Timm, 2018) investigated automatic sentence simplifications, using eye-tracking tools.…”
Section: Readability Prediction At Sentence Levelmentioning
confidence: 99%
“…One of the most well-known measure along these lines is the Flesch-Kincaid readability index (Kincaid et al, 1975), which combines these two measures into a global score. This approach has recently been renewed by the use of supervised statistical learning methods capable of integrating into the prediction of readability a very large number of linguistic characteristics (Schwarm and Ostendorf, 2005;Petersen and Ostendorf, 2009;Vajjala and Meurers, 2012;François and Fairon, 2012;Vajjala and Meurers, 2014;Brunato et al, 2018) aimed at capturing readability indices at the lexical, syntactic, semantic and even discursive levels. It can be argued that these enhanced feature sets are able to take into account so-called cognitive factors (Feng et al, 2009).…”
Section: Introductionmentioning
confidence: 99%