Literacy, Pregnancy and Potential Oral Health Changes: The Internet and Readability Levels

Wiener, R. Constance; Wiener-Pla, Regina

doi:10.1007/s10995-013-1290-1

Cited by 21 publications

(17 citation statements)

References 20 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…[3] have investigated methods for estimating user proficiency and readability of results, as well as for re-ranking results according to this information. In health web search, accounting for the readability of the retrieved information is a core requirement to effectively support users (see for example [16]). Health consumers may have a limited understanding of the medical terminology and processes, and thus they should be shown text that is simple to understand and limits expert terminology.…”

Section: Introductionmentioning

confidence: 99%

“…While recent research has proposed sophisticated readability estimation methods [3,7], often tailored to specific domains [17], traditional readability measures such as the Automated Readability Index and the Gunning Fog Index are extensively used for assessing information on the web (see for example [16,18]). These long-established readability measures consider the surface level of the text contained in web pages, that is, the wording and the syntax of sentences.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

The Influence of Pre-processing on the Estimation of Readability of Web Documents

Palotti

Zuccon

Hanbury

2015

Proceedings of the 24th ACM International on Conference on Information and Knowledge Management

View full text Add to dashboard Cite

This paper investigates the effect that text pre-processing approaches have on the estimation of the readability of web pages. Readability has been highlighted as an important aspect of web search result personalisation in previous work. The most widely used text readability measures rely on surface level characteristics of text, such as the length of words and sentences. We demonstrate that different tools for extracting text from web pages lead to very different estimations of readability. This has an important implication for search engines because search result personalisation strategies that consider users reading ability may fail if incorrect text readability estimations are computed.

show abstract

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

The Influence of Pre-processing on the Estimation of Readability of Web Documents

Palotti

Zuccon

Hanbury

2015

Proceedings of the 24th ACM International on Conference on Information and Knowledge Management

View full text Add to dashboard Cite

show abstract

“…Zheng and Yu have reported on the readability of electronic health records compared to Wikipedia pages related to diabetes and found that readability measures often do not align with user ratings of readability [22]. A common finding of these studies is that, in general, health content available on Web pages is often hard to understand by the general public; this includes content that is retrieved in topranked positions by current commercial search engines [4,5,6,7,8,9].…”

Section: Related Workmentioning

confidence: 99%

“…These measures generally rely on surface-level characteristics of text, such as characters, syllables and word counts (missing citation). While these measures have been widely used in studies investigating the understandability of health content retrieved by search engines (e.g., [4,5,6,7,8,9,18,21]), our preliminary work found that these measures are heavily affected by the methods used to extract text from the HTML source [13]. We were able to identify specific settings of an HTML preprocessing pipeline that provided consistent estimates, but due to the lack of human assessments, we were not able to investigate how well each HTML preprocessing pipeline correlated with human assessments.…”

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

Consumer Health Search on the Web: Study of Web Page Understandability and Its Integration in Ranking Algorithms (Preprint)

Palotti¹,

Zuccon²,

Hanbury³

2018

Preprint

View full text Add to dashboard Cite

Background: Understandability plays a key role in ensuring that people accessing health information are capable of gaining insights that can assist them with their health concerns and choices. The access to unclear or misleading information has been shown to negatively impact on the health decisions of the general public. Objective:We investigated methods to estimate the understandability of health Web pages and used these to improve the retrieval of information for people seeking health advice on the Web. Methods:Our investigation considered methods to automatically estimate the understandability of health information in Web pages, and it provided a thorough evaluation of these methods using human assessments as well as an analysis of preprocessing factors affecting understandability estimations, and associated pitfalls. Furthermore, lessons learnt for estimating Web page understandability were applied to the construction of retrieval methods with specific attention to retrieving information understandable by the general public. Results:We found that machine learning techniques were more suitable to estimate health Web page understandability than traditional readability formulae, which are often used as guidelines and benchmarking by health information providers on the Web (larger difference found for Pearson correlation of .602 using Gradient Boosting regressor compared to .438 using SMOG Index with CLEF 2015 collection). Learning to rank effectively exploited these estimates to provide the general public with more understandable search results ( H RBP ¿ reached 29.20, 22% higher than a BM25 baseline and 13% higher than the best system at CLEF 2016, both P≤ .001 ).

show abstract

Understandability Biased Evaluation for Information Retrieval

Zuccon

2016

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Literacy, Pregnancy and Potential Oral Health Changes: The Internet and Readability Levels

Cited by 21 publications

References 20 publications

The Influence of Pre-processing on the Estimation of Readability of Web Documents

The Influence of Pre-processing on the Estimation of Readability of Web Documents

Consumer Health Search on the Web: Study of Web Page Understandability and Its Integration in Ranking Algorithms (Preprint)

Understandability Biased Evaluation for Information Retrieval

Contact Info

Product

Resources

About