高级检索
当前位置: 首页 > 详情页

Explainable Machine Learning Model to Prediction EGFR Mutation in Lung Cancer

文献详情

资源类型:
Pubmed体系:
机构: [1]Department of Respiratory and Critical Care Medicine, West China Hospital, Sichuan University, Chengdu, China. [2]Institute of Respiratory Health Frontiers Science Center for Disease-related Molecular Network, West China Hospital, Sichuan University, Chengdu, China. [3]Precision Medicine Center, Precision Medicine Key Laboratory of Sichuan Province, West China Hospital, Sichuan University, Chengdu, China. [4]The Research Units of West China, Chinses Academy of Medical Sciences, West China Hospital, Chengdu, China.
出处:
ISSN:

关键词: EGFR mutation lung cancer prediction machine learning SHAP value

摘要:
The aim of this study is to determine whether the clinical features including blood markers can establish an explainable machine learning model to predict epidermal growth factor receptor (EGFR) mutation in lung cancer.We retrospectively analyzed 7,413 patients with lung adenocarcinoma (LA) diagnosed by gene sequencing in West China Hospital of the Sichuan University from April 2015 to June 2019. The machine learning algorithms (MLAs) included logistic regression (LR), random forest (RF), LightGBM, support vector machine (SVM), multi-layer perceptron (MLP), extreme gradient boosting (XGBoost), and decision tree (DT). Demographic characteristics, personal history, and blood markers were taken into. The area under the receiver operating characteristic curve (AUC) and SHapley Additive exPlanation (SHAP) value were used to explain the prediction models.Of the 7,413 patients with LA (47.6%), 3,527 were identified with EGFR mutation; RF achieved greatest performance in predicting EGFR mutation AUC [0.771, 95% confidence interval (CI): 0.770, 0.772], which was like XGBoost with AUC (0.740, 95% CI: 0.739, 0.741). The five most influential features were smoking consumption, sex, cholesterol, age, and albumin globulin ratio. The SHAP summary and dependence plot have been used to explain the affection of the 12 features to this model and how a single feature influences the output, respectively.We established EGFR mutation prediction models by MLAs and revealed that the RF was preferred, AUC (0.771, 95% CI: 0.770, 0.772), which was better than the traditional models. Therefore, the artificial intelligence-based MLA predicting model may become a practical tool to guide in diagnosis and therapy of LA.Copyright © 2022 Yang, Xiong, Wang and Li.

基金:
语种:
PubmedID:
中科院(CAS)分区:
出版当年[2022]版:
大类 | 3 区 医学
小类 | 3 区 肿瘤学
最新[2023]版:
大类 | 3 区 医学
小类 | 3 区 肿瘤学
第一作者:
第一作者机构: [1]Department of Respiratory and Critical Care Medicine, West China Hospital, Sichuan University, Chengdu, China.
共同第一作者:
通讯作者:
通讯机构: [1]Department of Respiratory and Critical Care Medicine, West China Hospital, Sichuan University, Chengdu, China. [2]Institute of Respiratory Health Frontiers Science Center for Disease-related Molecular Network, West China Hospital, Sichuan University, Chengdu, China. [3]Precision Medicine Center, Precision Medicine Key Laboratory of Sichuan Province, West China Hospital, Sichuan University, Chengdu, China. [4]The Research Units of West China, Chinses Academy of Medical Sciences, West China Hospital, Chengdu, China.
推荐引用方式(GB/T 7714):
APA:
MLA:

资源点击量:53080 今日访问量:0 总访问量:4588 更新日期:2025-01-01 建议使用谷歌、火狐浏览器 常见问题

版权所有©2020 四川省肿瘤医院 技术支持:重庆聚合科技有限公司 地址:成都市人民南路四段55号