Evaluating Computer Vision, Large Language, and Genome-Wide Association Models in a Limited Sized Patient Cohort for Pre-Operative Risk Stratification in Adult Spinal Deformity Surgery

Schonfeld, Ethan; Pant, Aaradhya; Shah, Aaryan; Sadeghzadeh, Sina; Pangal, Dhiraj; Rodrigues, Adrian; Yoo, Kelly; Marianayagam, Neelan; Haider, Ghani; Veeravagu, Anand

doi:10.3390/jcm13030656

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

Supporting

Mentioning

Contrasting

Year Published

2024

Publication Types

Select...

Article4

Preprint1

Relationship

Self Cite0

Independent5

Authors

Journals

Cited by 5 publications

References 31 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

An overview of the effects of algorithm use on judgmental biases affecting forecasting

Chacon,

Kaufmann

2024

International Journal of Forecasting

View full text Add to dashboard Cite

An overview of the effects of algorithm use on judgmental biases affecting forecasting

Chacon,

Kaufmann

2024

International Journal of Forecasting

View full text Add to dashboard Cite

Qualitative metrics from the biomedical literature for evaluating large language models in clinical decision-making: a narrative review

Ho,

Tian,

Ayers

et al. 2024

BMC Med Inform Decis Mak

View full text Add to dashboard Cite

Background The large language models (LLMs), most notably ChatGPT, released since November 30, 2022, have prompted shifting attention to their use in medicine, particularly for supporting clinical decision-making. However, there is little consensus in the medical community on how LLM performance in clinical contexts should be evaluated. Methods We performed a literature review of PubMed to identify publications between December 1, 2022, and April 1, 2024, that discussed assessments of LLM-generated diagnoses or treatment plans. Results We selected 108 relevant articles from PubMed for analysis. The most frequently used LLMs were GPT-3.5, GPT-4, Bard, LLaMa/Alpaca-based models, and Bing Chat. The five most frequently used criteria for scoring LLM outputs were “accuracy”, “completeness”, “appropriateness”, “insight”, and “consistency”. Conclusions The most frequently used criteria for defining high-quality LLMs have been consistently selected by researchers over the past 1.5 years. We identified a high degree of variation in how studies reported their findings and assessed LLM performance. Standardized reporting of qualitative evaluation metrics that assess the quality of LLM outputs can be developed to facilitate research studies on LLMs in healthcare. Supplementary Information The online version contains supplementary material available at 10.1186/s12911-024-02757-z.

show abstract

Machine learning applications in adult spinal deformity corrective surgery: a narrative review

Toossi,

Jerry

2024

Art Int Surg

View full text Add to dashboard Cite

Adult spinal deformity (ASD) poses significant challenges in spinal surgery, requiring precise planning and execution for successful correction. Additionally, optimization of outcomes and reducing the high complication rates of ASD surgeries are additional challenges facing spinal deformity surgeons. The advent of machine learning (ML) has revolutionized various aspects of healthcare, including spinal surgery. This review provides a comprehensive overview of the current state of ML applications in spinal deformity corrective surgery, highlighting its potential benefits and challenges.

show abstract

Evaluating Computer Vision, Large Language, and Genome-Wide Association Models in a Limited Sized Patient Cohort for Pre-Operative Risk Stratification in Adult Spinal Deformity Surgery

Cited by 5 publications

References 31 publications

An overview of the effects of algorithm use on judgmental biases affecting forecasting

An overview of the effects of algorithm use on judgmental biases affecting forecasting

Qualitative metrics from the biomedical literature for evaluating large language models in clinical decision-making: a narrative review

Machine learning applications in adult spinal deformity corrective surgery: a narrative review

Contact Info

Product

Resources

About