| |||||||
ShanghaiTech University Knowledge Management System
Multi-modal Long-Short Distance Attention-based Transformer-GAN for PET Reconstruction with Auxiliary MRI | |
2025 | |
发表期刊 | IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (IF:8.3[JCR-2023],7.1[5-Year]) |
ISSN | 1558-2205 |
EISSN | 1558-2205 |
卷号 | PP期号:99 |
发表状态 | 已发表 |
DOI | 10.1109/TCSVT.2025.3545911 |
摘要 | To obtain high-quality PET scans while minimizing potential radiation hazards for patients, various GAN-based methods have been developed to reconstruct high-quality standard-count PET (SPET) images from low-count PET (LPET) ones. While recent efforts try to integrate MRI or CT to enhance reconstruction in a multi-modal way, current architectures mainly face two limitations: (1) CNN backbones or simple Transformer bottleneck layers are insufficient for robust semantic understanding, and (2) the identical strategies for multi-modal feature extraction and fusion overlook each modality’s respective importance for the reconstruction task. In this work, we propose the Multi-modal Long-Short Distance Attention-based Transformer-GAN (MLSDA-GAN), a novel network combining 3D transformer and CNN architecture for PET image reconstruction. Specifically, to extract fine-grained features with a small number of parameters, our MLSDA-GAN integrates multi-scale convolution into the embedding part of the transformer. As for our multi-modal design, given the strong correlation between LPET and SPET in structural characteristics, we treat MRI as an auxiliary modality to LPET and achieve effective multi-modal extraction and fusion strategies. These strategies include (1) a PET-specific Self-attention Extraction (PSE) block for comprehensive feature extraction of the primary LPET and (2) a Multi-modality Cross-attention Fusion (MCF) block for effective multi-modal interaction and fusion, enabling us to more efficiently model both long- and short-range relationships in the corresponding feature extraction and fusion processes. Experiments demonstrate superiority of our method quantitatively and qualitatively. Code is available at https://github.com/Aru321/MLSDA-GAN. |
关键词 | Computerized tomography - Nuclear magnetic resonance - Reconstruction (structural) Attention mechanisms - Features extraction - Features fusions - High quality - High-quality standards - Multi-modal - Multi-modal PET reconstruction - PET images - PET reconstruction - PET Scan |
URL | 查看原文 |
收录类别 | EI |
语种 | 英语 |
出版者 | Institute of Electrical and Electronics Engineers Inc. |
EI入藏号 | 20251017982285 |
EI主题词 | Radiation hazards |
EI分类号 | 1301.2.2 Nuclear Physics - 405.2 Construction Methods - 746 Imaging Techniques - 914.1 Accidents and Accident Prevention |
原始文献类型 | Article in Press |
来源库 | IEEE |
文献类型 | 期刊论文 |
条目标识符 | https://kms.shanghaitech.edu.cn/handle/2MSLDSTB/493496 |
专题 | 生物医学工程学院_PI研究组_沈定刚组 |
作者单位 | 1.School of Computer Science, Sichuan University, China 2.School of Electrical and Information Engineering, University of Sydney, Australia 3.Department of Risk Controlling Research, JD.COM, China 4.School of Computer Science, Chengdu University of Information Technology, China 5.School of Biomedical Engineering & State Key Laboratory of Advanced Medical Materials and Devices, ShanghaiTech University, Shanghai, China 6.Shanghai United Imaging Intelligence Co., Ltd, Shanghai, China 7.Shanghai Clinical Research and Trial Center, Shanghai, China |
推荐引用方式 GB/T 7714 | Pinxian Zeng,Xinyi Zeng,Yan Wang,et al. Multi-modal Long-Short Distance Attention-based Transformer-GAN for PET Reconstruction with Auxiliary MRI[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY,2025,PP(99). |
APA | Pinxian Zeng.,Xinyi Zeng.,Yan Wang.,Luping Zhou.,Chen Zu.,...&Dinggang Shen.(2025).Multi-modal Long-Short Distance Attention-based Transformer-GAN for PET Reconstruction with Auxiliary MRI.IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY,PP(99). |
MLA | Pinxian Zeng,et al."Multi-modal Long-Short Distance Attention-based Transformer-GAN for PET Reconstruction with Auxiliary MRI".IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY PP.99(2025). |
条目包含的文件 | ||||||
文件名称/大小 | 文献类型 | 版本类型 | 开放类型 | 使用许可 |
修改评论
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。