Comparison of ChatGPT and Traditional Patient Education Materials for Men’s Health

Shah, Yash B.; Ghosh, Anushka; Hochberg, Aaron R.; Rapoport, Eli; Lallas, Costas D.; Shah, Mihir S.; Cohen, Seth D.

doi:10.1097/upj.0000000000000490

Cited by 23 publications

(8 citation statements)

References 24 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Notably, ChatGPT has the known ability to adjust reading levels of texts based on education level. 15 Further study of implementation of this AI tool into health care may consider whether responses generated by ChatGPT retain clarity, comprehensiveness, and accuracy when reading level of responses are requested to be appropriate for the general patient population. It should also be noted that responses from the two sources were of varying length, which may have contributed to complexity bias and affected scoring from participants.…”

Section: Discussionmentioning

confidence: 99%

“…A possible explanation for why patients indicated a preference for more complex material could be a welldocumented cognitive error known as "complexity bias," wherein people subconsciously demonstrate a preference for the complicated over the simple. 14 This logical fallacy may have prompted non-medical individuals to give greater credence to the more complex AI-generated material over the more readable ASRM material, although both were well above recommended readability levels. Notably, ChatGPT has the known ability to adjust reading levels of texts based on education level.…”

Section: Accepted Manuscriptmentioning

confidence: 99%

See 1 more Smart Citation

Both Patients and Plastic Surgeons Prefer Artificial Intelligence–Generated Microsurgical Information

Berry,

Fazilat,

Lavin

et al. 2024

J Reconstr Microsurg

View full text Add to dashboard Cite

Background: With the growing relevance of AI-based patient-facing information, microsurgical-specific online information provided by professional organizations was compared to that of ChatGPT and assessed for accuracy, comprehensiveness, clarity, and readability. Methods: Six plastic and reconstructive surgeons blindly assessed responses to ten microsurgery-related medical questions written either by American Society of Reconstructive Microsurgery (ASRM) or ChatGPT based on accuracy, comprehensiveness, and clarity. Surgeons were asked to choose which source provided the overall highest quality microsurgical patient-facing information. Additionally, 30 individuals with no medical background (ages 18-81, μ=49.8) were asked to determine a preference when blindly comparing materials. Readability scores were calculated, and all numerical scores were analyzed using the following six reliability formulas: Flesch-Kincaid Grade Level, Flesch-Kincaid Readability Ease, Gunning Fog Index, Simple Measure of Gobbledygook (SMOG) Index, Coleman-Liau Index, Linsear Write Formula (LWF), and Automated Readability Index. Statistical analysis of microsurgical-specific online sources was conducted utilizing paired t-tests. Results: Statistically significant differences in comprehensiveness and clarity were seen in favor of ChatGPT. Surgeons, 70.7% of the time, blindly choose ChatGPT as the source that overall provided the highest quality microsurgical patient-facing information. Non-medical individuals 55.9% of the time selected AI-generated microsurgical materials as well. Neither ChatGPT nor ASRM-generated materials were found to contain inaccuracies. Readability scores for both ChatGPT and ASRM materials were found to exceed recommended levels for patient proficiency across six readability formulas, with AI-based material scored as more complex. Conclusion: AI-generated patient-facing materials were preferred by surgeons in terms of comprehensiveness and clarity when blindly compared to online material provided by ASRM. Studied AI-generated material was not found to contain inaccuracies. Additionally, surgeons and non-medical individuals consistently indicated an overall preference for AI-generated material. A readability analysis suggested that both materials sourced from ChatGPT and ASRM surpassed recommended reading levels across six readability scores.

show abstract

Section: Discussionmentioning

confidence: 99%

Section: Accepted Manuscriptmentioning

confidence: 99%

Both Patients and Plastic Surgeons Prefer Artificial Intelligence–Generated Microsurgical Information

Berry,

Fazilat,

Lavin

et al. 2024

J Reconstr Microsurg

View full text Add to dashboard Cite

show abstract

“…Current automated simplification methods scored poorly due to grammatical errors, repetition, and inconsistencies in the autogenerated documents [ 68 ]. Artificial intelligence–derived text simplification methods may overcome these barriers by matching a document’s reading level to the readers’ needs, as shown in a study where ChatGPT was able to modify answers to men’s health condition questions to accommodate lower reading levels [ 69 , 70 ]. However, popularly used AI tools, such as ChatGPT, need considerable evaluation to minimize inaccurate information delivery and improve comprehensibility.…”

Section: Discussionmentioning

confidence: 99%

“…However, popularly used AI tools, such as ChatGPT, need considerable evaluation to minimize inaccurate information delivery and improve comprehensibility. Current studies indicate that these tools lack citations for the information they provide and cannot differentiate between low-quality and high-quality information [ 70 , 71 ].…”

Section: Discussionmentioning

confidence: 99%

Roles of Health Literacy in Relation to Social Determinants of Health and Recommendations for Informatics-Based Interventions: Systematic Review

Bindhu,

Nattam,

et al. 2024

Online J Public Health Inform

View full text Add to dashboard Cite

Background Health literacy (HL) is the ability to make informed decisions using health information. As health data and information availability increase due to online clinic notes and patient portals, it is important to understand how HL relates to social determinants of health (SDoH) and the place of informatics in mitigating disparities. Objective This systematic literature review aims to examine the role of HL in interactions with SDoH and to identify feasible HL-based interventions that address low patient understanding of health information to improve clinic note-sharing efficacy. Methods The review examined 2 databases, Scopus and PubMed, for English-language articles relating to HL and SDoH. We conducted a quantitative analysis of study characteristics and qualitative synthesis to determine the roles of HL and interventions. Results The results (n=43) were analyzed quantitatively and qualitatively for study characteristics, the role of HL, and interventions. Most articles (n=23) noted that HL was a result of SDoH, but other articles noted that it could also be a mediator for SdoH (n=6) or a modifiable SdoH (n=14) itself. Conclusions The multivariable nature of HL indicates that it could form the basis for many interventions to combat low patient understandability, including 4 interventions using informatics-based solutions. HL is a crucial, multidimensional skill in supporting patient understanding of health materials. Designing interventions aimed at improving HL or addressing poor HL in patients can help increase comprehension of health information, including the information contained in clinic notes shared with patients.

show abstract

“…Another study published by the American Urological Association compared readability between patient education materials created by urologists and responses from ChatGPT version 3.5. ChatGPT had significantly poorer readability than provider-created articles across all topics that were tested, despite being prompted to provide responses at a sixth-grade reading level (all p<0.001) [ 25 ]. It is important to identify opportunities and limitations for chatbot use in patient care settings so we can maximize its impact.…”

Section: Discussionmentioning

confidence: 99%

Assessing the Utility of ChatGPT in Simplifying Text Complexity of Patient Educational Materials

Sudharshan,

Shen,

Gupta

et al. 2024

Cureus

View full text Add to dashboard Cite

Introduction: AI chatbots are being increasingly used in healthcare settings. There is growing interest in using AI to assist in patient education. Currently, extensive healthcare information is found online but is often too complex to understand. Our objective is to determine if physicians can recommend the free version of ChatGPT version 3.5 (OpenAI, San Francisco, CA, USA) for patients to simplify text from the American Academy of Ophthalmology (AAO) in English and Spanish. This version of ChatGPT was assessed in this study due to its increased accessibility across various patient populations. Methods: Fifteen articles were chosen from AAO in both languages and simplified with ChatGPT 10 times each. The readability of original and simplified articles was assessed with the Flesch Reading Ease and Gunning Fog Index for English and Fernández Huerta, Gutiérrez, Szigriszt-Pazo, INFLESZ, and Legibilidad-µ for Spanish. Grade levels to assess readability were calculated with Flesch Kincaid Grade Level and Crawford Nivel-de-Grado. Mean, standard deviation, and two-tailed t-tests were performed to assess differences before and after simplification. Results: Average grade levels before and after simplification were as follows: English 8.43±1.17 to 8.9±2.1 (p=0.41) and Spanish 5.3±0.34 to 4.1±1.1 (p=0.0001). Spanish articles were significantly simplified per Legibilidad-µ (p=0.003). No significant difference was noted for other scales. Conclusions: The readability of AAO articles in English worsened without significance but significantly improved in Spanish. This may result from simpler syllable structures and a lesser overall vocabulary in Spanish. With increased testing, physicians can recommend ChatGPT for Spanish-speaking patients to improve health literacy.

show abstract

Comparison of ChatGPT and Traditional Patient Education Materials for Men’s Health

Cited by 23 publications

References 24 publications

Both Patients and Plastic Surgeons Prefer Artificial Intelligence–Generated Microsurgical Information

Both Patients and Plastic Surgeons Prefer Artificial Intelligence–Generated Microsurgical Information

Roles of Health Literacy in Relation to Social Determinants of Health and Recommendations for Informatics-Based Interventions: Systematic Review

Assessing the Utility of ChatGPT in Simplifying Text Complexity of Patient Educational Materials

Contact Info

Product

Resources

About