Generalizable and Automated Classification of TNM Stage from Pathology Reports with External Validation

Kefeli, Jenna; Tatonetti, Nicholas P.

doi:10.1101/2023.06.26.23291912

Cited by 3 publications

(2 citation statements)

References 16 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Many studies have so far used machine learning algorithms for breast cancer classification and diagnosis [25]. Other studies tried to use different machine learning methods [26], combination of machine learning and rule-based approach [27], or very recently use large language models [28] to predict TNM stages of breast cancer from pathology text reports. The objective of the study was to propose a simple machine learning based model that can automatically (and with minimal data preparation) classify clinical/surgical pathology reports of breast cancer based on TNM stages.…”

Section: Discussionmentioning

confidence: 99%

Using Machine Learning Algorithms in Determining the Stage of Breast Cancer from Pathology Reports

Samadzad-Qushchi,

Eskandarian,

Niazkhani

et al. 2024

Front Health Inform

View full text Add to dashboard Cite

Introduction: After a cancer diagnosis, the most important thing is to determine the stage and grade of the cancer. Pathology reports are the main source for cancer staging, but they do not contain all the information needed for the staging. However, the text of these reports is sometimes the only available information. We were interested in knowing whether text mining methods can be used to predict staging only from pathology reports.Material and Methods: A total of 698 pathology reports of breast cancer cases and their TNM staging collected from multiple centers in West Azerbaijan Province, Iran were used for this study. After preparing the semi-structured reports, the texts of the reports were imported into a program written by Python V3. Three machine learning algorithms of Logistic Regression, SVM, and Naïve Bayes and a simple pipeline were used for the purpose of text mining. The performance of the algorithms was evaluated in terms of accuracy, precision, recall, and F1 score.Results: The Naïve Bayes algorithm achieved excellent results and a value rate of higher than 91% in all evaluation criteria (accuracy, precision, recall and F1 score). This means that the Naïve Bayes algorithm could classify the reports with high efficiency and its predictions were more correct than the other two algorithms. Naïve Bayes also outperformed SVM and Logistic Regression in terms of accuracy, recall and F1 score. In addition, Naïve-Bayes showed faster inference due to its simplicity and lower computational and training time.Conclusion: We suggest using the proposed design in this study for predicting breast cancer staging, where there is a need but not all necessary information except pathology reports. This method may not be a useful for clinical management of cancer patients, but it can be safely used for epidemiological estimations.

show abstract

Section: Discussionmentioning

confidence: 99%

Using Machine Learning Algorithms in Determining the Stage of Breast Cancer from Pathology Reports

Samadzad-Qushchi,

Eskandarian,

Niazkhani

et al. 2024

Front Health Inform

View full text Add to dashboard Cite

show abstract

“…Fijacko et al performed multinomial classification of abstract titles using the ChatGPT-4 application programming interface (API), through a python function call with predefined prompts, demonstrating the effectiveness of LLM-based approaches in bibliometric analysis [44]. Using optical character recognition to convert pathology reports into a textual format, Kefeli and Tatonetti trained several BERT-based models for TNM stage and cancer type classification [45,46]. Fang and Wang used several BERT models pre-trained on scientific literature for multi-label topic classification, achieving F1-scores over 90% [47].…”

Section: Text Classificationmentioning

confidence: 99%

Applications of Large Language Models in Pathology

Cheng

2024

Bioengineering

View full text Add to dashboard Cite

Large language models (LLMs) are transformer-based neural networks that can provide human-like responses to questions and instructions. LLMs can generate educational material, summarize text, extract structured data from free text, create reports, write programs, and potentially assist in case sign-out. LLMs combined with vision models can assist in interpreting histopathology images. LLMs have immense potential in transforming pathology practice and education, but these models are not infallible, so any artificial intelligence generated content must be verified with reputable sources. Caution must be exercised on how these models are integrated into clinical practice, as these models can produce hallucinations and incorrect results, and an over-reliance on artificial intelligence may lead to de-skilling and automation bias. This review paper provides a brief history of LLMs and highlights several use cases for LLMs in the field of pathology.

show abstract

Beyond Self-consistency: Ensemble Reasoning Boosts Consistency and Accuracy of LLMs in Cancer Staging

Chang,

Lucas,

Lee

et al. 2024

Artificial Intelligence in Medicine

View full text Add to dashboard Cite

Generalizable and Automated Classification of TNM Stage from Pathology Reports with External Validation

Cited by 3 publications

References 16 publications

Using Machine Learning Algorithms in Determining the Stage of Breast Cancer from Pathology Reports

Using Machine Learning Algorithms in Determining the Stage of Breast Cancer from Pathology Reports

Applications of Large Language Models in Pathology

Beyond Self-consistency: Ensemble Reasoning Boosts Consistency and Accuracy of LLMs in Cancer Staging

Contact Info

Product

Resources

About