高级检索
当前位置: 首页 > 详情页

Development and validation of a small-sample machine learning model to predict 5-year overall survival in patients with hepatocellular carcinoma

文献详情

资源类型:
WOS体系:
Pubmed体系:

收录情况: ◇ SCIE

机构: [1]Sichuan Univ, West China Hosp, Canc Ctr, Div Head & Neck Tumor Multimodal Treatment, Chengdu 610041, Sichuan, Peoples R China [2]Sichuan Univ, West China Hosp, Lab Tumor Therapy & Immunol Res, Chengdu 610041, Sichuan, Peoples R China [3]Hunan Univ, Dept Stat & Data Sci, Changsha 410000, Hunan, Peoples R China [4]Cent South Univ, Sch Life Sci, Changsha 410078, Hunan, Peoples R China [5]UAM, Inst Micro & Nanotecnol, IMN, CSIC,CEI,CNM, Isaac Newton 8, E-28706 Madrid, Spain [6]Sun Yat Sen Univ, Affiliated Hosp 1, Precis Med Inst, Guangzhou 510080, Guangdong, Peoples R China
出处:
ISSN:

关键词: HCC OS Small-sample ML Clinicopathological characteristic Hematological biomarker CTC CEC

摘要:
BackgroundEarly-onset hepatocellular carcinoma (HCC) is insidious, with characteristics of easy metastasis, high recurrence rate, and significant mortality. To address the substantial time and resource demands associated with HCC prognostic prediction, we extract meaningful insights from limited small-sample data to develop and validate a prediction model for HCC 5-year overall survival (OS) by machine learning (ML).Methods76 newly diagnosed patients with HCC were eventually enrolled between September 2018 and July 2019. The follow-up time was 1-67 months. Patients who survived for 5 years after the first surgery, were divided into a surviving group (n = 34) and a nonsurviving group (n = 42). Pathological data and related survival factors of patients were collected before treatment. The final subset of features was filtered. Prediction models for 5-year OS in patients with HCC were established by logistic regression (LR), support vector machine (SVM), decision tree classification (DTC), random forests (RF), and extreme gradient Boosting (XGBoost), respectively. Additionally, the optimal model was established after rigorous validation. The models were evaluated by values of specificity, F1 score, recall, accuracy and area under the receiver operating characteristic curve (AUC-ROC). The decision curve analysis (DCA) method was assessed the evaluation. Finally, internal and external validations were performed to further validate model' robustness.ResultsThe significant variable set, which included 22 variables, was screened. Ranking the importance of variables, the top 22 characteristic variables were as follows: the maximum diameter, presence or absence of distant metastasis, CNLC stage, ALB, age, RBC, the large size CTC, total bilirubin (TBIL), PD-L1 (-) CTC, >= Pentaploid CTC, AFP, vascular cancer thrombus and satellite nodules, WBC, CTC, BCLC stage, multiple nodules, AST, PD-L1 (-) CTC-WBC cluster, Triploid CTC, LYM, PD-L1 (-) CEC-WBC cluster and degree of cirrhosis. The AUC-ROC values for predicting the 5-year OS rate of HCC patients by the logistic regression, SVM, DTC, RF, and XGBoost models were 0.737, 0.971, 0.657, 0.741, and 0.703, respectively. Among them, the SVM model had the best performance (Accuracy = 0.987, F1 score = 0.988, Recall value = 1.000). The SVM algorithm demonstrated superior performance and stability in the internal and external validations of the model.ConclusionThe SVM model could predict the 5-year OS in HCC with good recognition ability and achieves significantly greater accuracy compared to traditional models. Diagnosis and treatment could be utilized to intervene in the risk factors in this model, thereby improving patient prognosis.

基金:
语种:
WOS:
PubmedID:
中科院(CAS)分区:
出版当年[2025]版:
大类 | 3 区 医学
小类 | 3 区 肿瘤学
最新[2025]版:
大类 | 3 区 医学
小类 | 3 区 肿瘤学
JCR分区:
出版当年[2024]版:
Q2 ONCOLOGY
最新[2024]版:
Q2 ONCOLOGY

影响因子: 最新[2024版] 最新五年平均 出版当年[2024版] 出版当年五年平均 出版前一年[2024版]

第一作者:
第一作者机构: [1]Sichuan Univ, West China Hosp, Canc Ctr, Div Head & Neck Tumor Multimodal Treatment, Chengdu 610041, Sichuan, Peoples R China [2]Sichuan Univ, West China Hosp, Lab Tumor Therapy & Immunol Res, Chengdu 610041, Sichuan, Peoples R China
通讯作者:
推荐引用方式(GB/T 7714):
APA:
MLA:

资源点击量:65780 今日访问量:0 总访问量:5151 更新日期:2025-12-01 建议使用谷歌、火狐浏览器 常见问题

版权所有©2020 四川省肿瘤医院 技术支持:重庆聚合科技有限公司 地址:成都市人民南路四段55号