高级检索
当前位置: 首页 > 详情页

A CNN-transformer-based hybrid U-shape model with long-range relay for esophagus 3D CT image gross tumor volume segmentation

文献详情

资源类型:
WOS体系:
Pubmed体系:

收录情况: ◇ SCIE

机构: [1]Shandong Univ, Sch Mech Elect & Informat Engn, Weihai, Peoples R China [2]UT Southwestern Med Ctr, Dept Radiat Oncol, Dallas, TX USA [3]Hangzhou Dianzi Univ, Sch Cyberspace, Hangzhou 310018, Peoples R China [4]Hangzhou Dianzi Univ, Intelligent Informat Proc Lab, Hangzhou, Peoples R China [5]Hangzhou Dianzi Univ, Sch Commun Engn, Hangzhou, Peoples R China [6]Univ Elect Sci & Technol China, Sichuan Canc Hosp & Inst, Sichuan Canc Ctr, Dept Radiat Oncol,Sch Med,Radiat Oncol Key Lab Sic, Chengdu 610000, Peoples R China [7]Shandong Univ, Suzhou Res Inst, Suzhou, Peoples R China
出处:
ISSN:

关键词: computed tomography deep learning esophageal gross tumor volume image segmentation

摘要:
Background Accurate and reliable segmentation of esophageal gross tumor volume (GTV) in computed tomography (CT) is beneficial for diagnosing and treating. However, this remains a challenging task because the esophagus has a variable shape and extensive vertical range, resulting in tumors potentially appearing at any position within it. Purpose This study introduces a novel CNN-transformer-based U-shape model (LRRM-U-TransNet) designed to enhance the segmentation accuracy of esophageal GTV. By leveraging advanced deep learning techniques, we aim to address the challenges posed by the variable shape and extensive range of the esophagus, ultimately improving diagnostic and treatment outcomes. Methods Specifically, we propose a long-range relay mechanism to converge all layer feature information by progressively passing adjacent layer feature maps in the pixel and semantic pathways. Moreover, we propose two ready-to-use blocks to implement this mechanism concretely. The Dual FastViT block interacts with feature maps from two paths to enhance feature representation capabilities. The Dual AxialViT block acts as a secondary auxiliary bottleneck to acquire global information for more precise feature map reconstruction. Results We build a new esophageal tumor dataset with 1665 real-world patient CT samples annotated by five expert radiologists and employ multiple evaluation metrics to validate our model. Results of a five-fold cross-validation on this dataset show that LRRM-U-TransNet achieves a Dice coefficient of 0.834, a Jaccard coefficient of 0.730, a Precision of 0.840, a HD95 of 3.234 mm, and a Volume Similarity of 0.143. Conclusions We propose a CNN-Transformer hybrid deep learning network to improve the segmentation effect of esophageal tumors. We utilize the local and global information between shallower and deeper layers to prevent early information loss and enhance the cross-layer communication. To validate our model, we collect a dataset composed of 1665 CT images of esophageal tumors from Sichuan Tumor Hospital. The results show that our model outperforms the state-of-the-art models. It is of great significance to improve the accuracy and clinical application of esophageal tumor segmentation.

基金:
语种:
WOS:
PubmedID:
中科院(CAS)分区:
出版当年[2025]版:
大类 | 3 区 医学
小类 | 3 区 核医学
最新[2025]版:
大类 | 3 区 医学
小类 | 3 区 核医学
JCR分区:
出版当年[2025]版:
最新[2023]版:
Q1 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING

影响因子: 最新[2023版] 最新五年平均 出版当年[2024版] 出版当年五年平均 出版前一年[2024版]

第一作者:
第一作者机构: [1]Shandong Univ, Sch Mech Elect & Informat Engn, Weihai, Peoples R China
通讯作者:
通讯机构: [3]Hangzhou Dianzi Univ, Sch Cyberspace, Hangzhou 310018, Peoples R China [7]Shandong Univ, Suzhou Res Inst, Suzhou, Peoples R China
推荐引用方式(GB/T 7714):
APA:
MLA:

资源点击量:57659 今日访问量:1 总访问量:4764 更新日期:2025-05-01 建议使用谷歌、火狐浏览器 常见问题

版权所有©2020 四川省肿瘤医院 技术支持:重庆聚合科技有限公司 地址:成都市人民南路四段55号