Predicting the pathological invasiveness in patients with a solitary pulmonary nodule via Shapley additive explanations interpretation of a tree-based machine learning radiomics model: a multicenter study
Background: Radiomics models could help assess the benign and malignant invasiveness and prognosis of pulmonary nodules. However, the lack of interpretability limits application of these models. We thus aimed to construct and validate an interpretable and generalized computed tomography (CT) radiomics model to evaluate the pathological invasiveness in patients with a solitary pulmonary nodule in order to improve the management of these patients.Methods: We retrospectively enrolled 248 patients with CT-diagnosed solitary pulmonary nodules. Radiomic features were extracted from nodular region and perinodular regions of 3 and 5 mm. After coarse to-fine feature selection, the radiomics score (radscore) was calculated using the least absolute shrinkage and selection operator logistic method. Univariate and multivariate logistic regression analyses were performed to determine the invasiveness-related clinicoradiological factors. The clinical-radiomics model was then constructed using the logistic and extreme gradient boosting (XGBoost) algorithms. The Shapley additive explanations (SHAP) method was then used to explain the contributions of the features. After removing batch effects with the ComBat algorithm, we assessed the generalization of the explainable clinical-radiomics model in two independent external validation cohorts (n=147 and n=149).Results: The clinical-radiomic XGBoost model integrating the radscore, CT value, nodule length, and crescent sign demonstrated better predictive performance than did the clinical-radiomics logistic model in assessing pulmonary nodule invasiveness, with an area under the receiver operating characteristic (ROC) curve (AUC) of 0.889 [95% confidence interval (CI), 0.848-0.927] in the training cohort. The SHAP algorithm illustrates the contribution of each feature in the final model. The specific model decision process was visualized using a tree-based decision heatmap. Satisfactory generalization performance was shown with AUCs of 0.889 (95% CI, 0.823-0.942) and 0.915 (95% CI, 0.851-0.963) in the two external validation cohorts.Conclusions: An interpretable and generalized clinical-radiomics model for predicting pulmonary nodule invasibility was constructed to help clinicians determine the invasiveness of pulmonary nodules and devise assessment strategies in an easily understandable manner.
基金:
Guangdong Medical Science and Technology Research Fund [A2021483]; Research Launch Project of Shunde Hospital of Southern Medical University [SRSP2021021]; Medical Health Science and Technology Project Zhejiang Provincial Health Commission [2023ky338]; Science and Technology Planning Project of Foshan [2220001005383]
第一作者机构:[1]Southern Med Univ, Shunde Hosp, Peoples Hosp Shunde 1, Dept Radiol, 1 Jiazi Rd, Foshan 528308, Peoples R China
共同第一作者:
通讯作者:
通讯机构:[1]Southern Med Univ, Shunde Hosp, Peoples Hosp Shunde 1, Dept Radiol, 1 Jiazi Rd, Foshan 528308, Peoples R China[6]Lecong Hosp Shunde, Dept Radiol, 45 Lecong Ave, Foshan 528315, Peoples R China[*1]Department of Radiology, Shunde Hospital, Southern Medical University (The First People’s Hospital of Shunde), No. 1 Jiazi Road, Lunjiao, Shunde District, Foshan 528308, China.[*2]Department of Radiology, Lecong Hospital of Shunde, No. 45 Lecong Avenue, Shunde District, Foshan 528315, China.[*3]Department of Radiology, Shunde Hospital, Southern Medical University (The First People’s Hospital of Shunde), No. 1 Jiazi Road, Lunjiao, Shunde District, Foshan 528308, China
推荐引用方式(GB/T 7714):
Zhang Rong,Hong Minping,Cai Hongjie,et al.Predicting the pathological invasiveness in patients with a solitary pulmonary nodule via Shapley additive explanations interpretation of a tree-based machine learning radiomics model: a multicenter study[J].QUANTITATIVE IMAGING IN MEDICINE AND SURGERY.2023,13(12):7828-+.doi:10.21037/qims-23-615.
APA:
Zhang, Rong,Hong, Minping,Cai, Hongjie,Liang, Yanting,Chen, Xinjie...&Hu, Qiugen.(2023).Predicting the pathological invasiveness in patients with a solitary pulmonary nodule via Shapley additive explanations interpretation of a tree-based machine learning radiomics model: a multicenter study.QUANTITATIVE IMAGING IN MEDICINE AND SURGERY,13,(12)
MLA:
Zhang, Rong,et al."Predicting the pathological invasiveness in patients with a solitary pulmonary nodule via Shapley additive explanations interpretation of a tree-based machine learning radiomics model: a multicenter study".QUANTITATIVE IMAGING IN MEDICINE AND SURGERY 13..12(2023):7828-+