Development and Validation of Machine Learning–based Model for the Prediction of Malignancy in Multiple Pulmonary Nodules: Analysis from Multicentric Cohorts

Nie, Yuntao; Park, Samina; Zhang, Kai; Zhang, Yangming; Liu, Yuan; Hui, Bengang; Zhou, Lixin; Wang, Xun; Qi, Qingyi; Li, Hao; Kang, Guannan; Huang, Yuqing; Chen, Yingtai; Liu, Jiabao; Cui, Jian; Li, Mingru; Park, In Kyu; Kang, Chang Hyun; Shen, Haifeng; Yang, Yingshun; Guan, Tianwang; Zhang, Yaxiao; Yang, Fan; Kim, Young Tae; Wang, Jun

doi:10.1158/1078-0432.ccr-20-4007

Cited by 22 publications

(21 citation statements)

References 36 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Most often used and externally validated models (Brock, Mayo, PKU, VA) were selected for a network meta-analysis; the summary receiver operating characteristic (SROC) curve was plotted with the method proposed by Reitsma et al. 15 ; and the area under the SROC curve (AUSROC) was calculated. Sensitivity and specificity of each model were also pooled using analysis of variance model, 16 and diagnostic OR and superiority index were calculated.…”

Section: Methodsmentioning

confidence: 99%

Comprehensive Analysis of Clinical Logistic and Machine Learning-Based Models for the Evaluation of Pulmonary Nodules

Zhang

Wei

Nie

et al. 2022

JTO Clinical and Research Reports

Self Cite

View full text Add to dashboard Cite

Section: Methodsmentioning

confidence: 99%

Comprehensive Analysis of Clinical Logistic and Machine Learning-Based Models for the Evaluation of Pulmonary Nodules

Zhang

Wei

Nie

et al. 2022

JTO Clinical and Research Reports

Self Cite

View full text Add to dashboard Cite

“…With the development of artificial intelligence technology, the machine learning models provided a better alternative for creating applicable predictive clinical diagnosis tools. In this study, we developed and validated a diagnostic nomogram model to improve the diagnostic accuracy of lung cancer based on AI tools and clinical data ( 3 , 10 , 13 ).…”

Section: Discussionmentioning

confidence: 99%

“…Low-dose computed tomography (LDCT) is the main method for public physical screening. The tumor markers assessment in hospital including carcinoembryonic antigen (CEA) and cytokeratin 19 fragment antigen21-1(CYFRA21-1) can improve the diagnosis rate ( 3 ). Artificial intelligence ( AI) models are a step forward from automated nodule diagnosis, as they typically do not require nodule measurement or data entry.…”

Section: Introductionmentioning

confidence: 99%

Development and validation of a nomogram model for lung cancer based on radiomics artificial intelligence score and clinical blood test data

Zhang

Saber

et al. 2023

Front. Oncol.

View full text Add to dashboard Cite

BackgroundArtificial intelligence (AI) discrimination models using single radioactive variables in recognition algorithms of lung nodules cannot predict lung cancer accurately. Hence, we developed a clinical model that combines AI with blood test variables to predict lung cancer.MethodsBetween 2018 and 2021, 584 individuals (358 patients with lung cancer and 226 individuals with lung nodules other than cancer as control) were enrolled prospectively. Machine learning algorithms including lasso regression and random forest (RF) were used to select variables from blood test data, Logistic regression analysis was used to reconfirm the features to build the nomogram model. The predictive performance was assessed by performing the receiver operating characteristic (ROC) curve analysis as well as calibration, clinical decision and impact curves. A cohort of 48 patients was used to independently validate the model. The subgroup application was analyzed by pathological diagnosis.FindingsA total of 584 patients were enrolled (358 lung cancers, 61.30%,226 patients for the control group) to establish the model. The integrated model identified eight potential factors including carcinoembryonic antigen (CEA), AI score, Pro-Gastrin Releasing Peptide (ProGRP), cytokeratin 19 fragment antigen21-1(CYFRA211), squamous cell carcinoma antigen(SCC), indirect bilirubin(IBIL), activated partial thromboplastin time(APTT) and age. The area under the curve (AUC) of the nomogram was 0.907 (95% CI, 0.881-0.929). The decision and clinical impact curves showed good predictive accuracy of the model. An AUC of 0.844 (95% CI, 0.710 - 0.932) was obtained for the external validation group.ConclusionThe nomogram model integrating AI and clinical data can accurately predict lung cancer, especially for the squamous cell carcinoma subtype.

show abstract

“…In recent years, there were studies differentiating benign from malignant pulmonary nodules [38,39] or predicting the invasiveness of a lesion [13,40,41] with the help of arti cial intelligence (AI). Machine learning-based models were built for multiple nodules to predict lung malignancy [42], or borrowed from solitary nodules to diagnose MPLC [14]. These methods could be inappropriate in that each patient should be considered as a whole rather than targeting each lesion separately.…”

Section: Discussionmentioning

confidence: 99%

A pairwise radiomics algorithm - lesion pair relation estimation (PRE) model for distinguishing multiple primary lung cancer (MPLC) from intrapulmonary metastasis (IPM)

Chen

Yang

Chen³

et al. 2022

Preprint

View full text Add to dashboard Cite

Background Distinguishing multiple primary lung cancer (MPLC) from intrapulmonary metastasis (IPM) is critical for their disparate treatment strategy and prognosis. This study aimed to establish a non-invasive model to make the differentiation pre-operatively. Methods We retrospectively studied 168 patients with multiple lung cancers (307 pairs of lesions) including 118 cases for modeling and internal validation, and 50 cases for independent external validation. Radiomic features on computed tomography (CT) were extracted to calculate the absolute deviation of paired lesions. Features were then selected by correlation coefficients and random forest classifier five-fold cross-validation, based on which the lesion pair relation estimation (PRE) model was developed. A major voting strategy was used to decide diagnosis for cases with multiple pairs of lesions. Cases from another institute were included as the external validation set for the PRE model to compete with two experienced clinicians. Results Seven radiomic features were selected for the PRE model construction. With major voting strategy, the mean area under receiver operating characteristic curve (AUC), accuracy, sensitivity, and specificity of the training vs. internal validation vs. external validation cohort to distinguish MPLC were 0.983 vs. 0.844 vs. 0.793, 0.942 vs. 0.846 vs. 0.760, 0.905 vs. 0.728 vs. 0.727, and 0.962 vs. 0.910 vs. 0.769, respectively. AUCs of the two clinicians were 0.619 and 0.580. Conclusions The CT radiomic feature-based lesion PRE model is potentially an accurate diagnostic tool for the differentiation of MPLC and IPM, which could help with clinical decision making.

show abstract

Development and Validation of Machine Learning–based Model for the Prediction of Malignancy in Multiple Pulmonary Nodules: Analysis from Multicentric Cohorts

Cited by 22 publications

References 36 publications

Comprehensive Analysis of Clinical Logistic and Machine Learning-Based Models for the Evaluation of Pulmonary Nodules

Comprehensive Analysis of Clinical Logistic and Machine Learning-Based Models for the Evaluation of Pulmonary Nodules

Development and validation of a nomogram model for lung cancer based on radiomics artificial intelligence score and clinical blood test data

A pairwise radiomics algorithm - lesion pair relation estimation (PRE) model for distinguishing multiple primary lung cancer (MPLC) from intrapulmonary metastasis (IPM)

Contact Info

Product

Resources

About