2020
DOI: 10.1007/978-3-030-45439-5_3
|View full text |Cite
|
Sign up to set email alerts
|

ReadNet: A Hierarchical Transformer Framework for Web Article Readability Analysis

Abstract: Analyzing the readability of articles has been an important sociolinguistic task. Addressing this task is necessary to the automatic recommendation of appropriate articles to readers with different comprehension abilities, and it further benefits education systems, web information systems, and digital libraries. Current methods for assessing readability employ empirical measures or statistical learning techniques that are limited by their ability to characterize complex patterns such as article structures and … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
11
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 13 publications
(12 citation statements)
references
References 35 publications
1
11
0
Order By: Relevance
“…Table 3 shows the results of the models' operation on the test dataset (1,500 texts, about 150 texts per complexity level). The most successful (F1-score = 94.8%) is the Transformer neural network model, which is consistent with the results obtained by Meng (2020) for the English language.…”
Section: Flesch Index =supporting
confidence: 89%
See 1 more Smart Citation
“…Table 3 shows the results of the models' operation on the test dataset (1,500 texts, about 150 texts per complexity level). The most successful (F1-score = 94.8%) is the Transformer neural network model, which is consistent with the results obtained by Meng (2020) for the English language.…”
Section: Flesch Index =supporting
confidence: 89%
“…The second approach is successfully applied, for example, by Meng et al (2020). The authors propose the pre-processing of a text using the ReadNet system of linguistic characteristics, which allows the neural network to 'decode' the text and handle the classification task a little faster.…”
Section: Machine Learning and Natural Language Processing Methods For Assessing The Readability Of Textsmentioning
confidence: 99%
“…But all hybrid models beat previous SOTA results by a large margin. Notably, we achieve the near-perfect accuracy of 99% on OneStopEnglish, a massive 20.3% increase from the previous SOTA (Martinc et al, 2021) by HAN (Meng et al, 2020).…”
Section: Hybrid Model Results and Limitationsmentioning
confidence: 64%
“…Regarding the model architecture, we examined appending the handcrafted features to transformer embeddings without the use of a secondary predic- (Meng et al, 2020) hints that such a model is not robust to small datasets. ReadNet reports 52.8% accuracy on Cambridge, worse than any of our tested models (table 2, 3, 5).…”
Section: Why Not Directly Append Features?mentioning
confidence: 99%
“…Later machine learning methods for readability assessment may rely on feature extraction (e.g. Meng et al, 2020;Deutsch et al, 2020; also see for an analysis of different approaches).…”
Section: Linguistic Complexitymentioning
confidence: 99%