Breast cancer is the most common type of cancer worldwide. Alarmingly, approximately 30% of breast cancer cases result in disease recurrence at distant organs after treatment. Distant recurrence is more common in some subtypes such as invasive breast carcinoma (IBC). While clinicians have utilized several clinicopathological measurements to predict distant recurrences in IBC, no studies have predicted distant recurrences by combining clinicopathological evaluations of IBC tumors pre- and post-therapy with machine learning (ML) models. The goal of our study was to determine whether classification-based ML techniques could predict distant recurrences in IBC patients using key clinicopathological measurements, including pathological staging of the tumor and surrounding lymph nodes assessed both pre- and post-neoadjuvant therapy, response to therapy via standard-of-care imaging, and binary status of adjuvant therapy administered to patients. We trained and tested four clinicopathological ML models using a dataset (144 and 17 patients for training and testing, respectively) from Duke University and validated the best-performing model using an external dataset (8 patients) from Dartmouth Hitchcock Medical Center. The random forest model performed better than the C-support vector classifier, multilayer perceptron, and logistic regression models, yielding AUC values of 1.0 in the testing set and 0.75 in the validation set (p < 0.002) across both institutions, thereby demonstrating the cross-institutional portability and validity of ML models in the field of clinical research in cancer. The top-ranking clinicopathological measurement impacting the prediction of distant recurrences in IBC were identified to be tumor response to neoadjuvant therapy as evaluated via SOC imaging and pathology, which included tumor as well as node staging.
Breast cancer ranks second in the most common cancer in women worldwide with 30% of cases resulting into recurrence of the disease at distant organs post the treatment. While clinicians have utilized several clinicopathological measurements for prediction of distant recurrences in invasive breast carcinoma (IBC), none of those studies have showcased the potential of combining clinicopathological evaluations of IBC tumors pre and post therapies using machine learning (ML) or artificially intelligent (AI) models to predict the distant recurrence of the disease in respective patients. The goal of our study was to determine whether classification-based ML/AI techniques can predict distant recurrences in IBC patients using key clinicopathological measurements that includes pathological staging of tumor and surrounding lymph nodes deemed both pre- and post-neoadjuvant therapy, imaging-based therapy responses, and the status of adjuvant therapy administered to patients. We trained and tested clinicopathological ML/AI model using dataset from Duke University and validated it using external dataset from Dartmouth Hitchcock Medical Center (DHMC). Random Forest (RF) model performed best compared to C-Support Vector Classifier (SVC) and Multi-Layer Perceptron (MLP) yielding AUC ranging 0.75-1.0 (p<0.002) across both the institutions, thereby demonstrating the cross-institutional portability and validity of ML/AI models in the field of clinical research in cancer.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.