| |||||||
ShanghaiTech University Knowledge Management System
Learning better contrastive view from radiologist's gaze | |
2025-06 | |
发表期刊 | PATTERN RECOGNITION (IF:7.5[JCR-2023],7.6[5-Year]) |
ISSN | 0031-3203 |
EISSN | 1873-5142 |
卷号 | 162 |
发表状态 | 已发表 |
DOI | 10.1016/j.patcog.2025.111350 |
摘要 | Recent advancements in self-supervised contrastive learning have shown significant benefits from utilizing a Siamese network architecture, which focuses on reducing the distances between similar (positive) pairs of data. These methods often employ random data augmentations on input images, with the expectation that these augmented views of the same image will be recognized as similar and thus, positively paired. However, this approach of random augmentation may not fully consider the semantics of the image, potentially leading to a reduction in the quality of the augmented images for contrastive learning. This challenge is particularly pronounced in the domain of medical imaging, where disease-related anomalies can be subtle and easily corrupted. In this study, we initially show that for commonly used X-ray images, traditional augmentation techniques employed in contrastive pre-training can negatively impact the performance of subsequent diagnostic or classification tasks. To address this, we introduce a novel augmentation method, i.e., FocusContrast, to learn from radiologists’ gaze during diagnosis and generate contrastive views with guidance from radiologists’ visual attention. Specifically, we track the eye movements of radiologists to understand their visual attention while diagnosing X-ray images. This understanding allows the saliency prediction model to predict where a radiologist might focus when presented with a new image, guiding the attention-aware augmentation that maintains crucial details related to diseases. As a plug-and-play and module, FocusContrast can enhance the performance of contrastive learning frameworks like SimCLR, MoCo, and BYOL. Our results show consistent improvements on datasets of knee X-rays and digital mammography, demonstrating the effectiveness of incorporating radiological expertise into the augmentation process for contrastive learning in medical imaging. © 2025 Elsevier Ltd |
关键词 | % reductions Augmented images Data augmentation Eye-tracking Human visual attention Input image Performance Random data Visual Attention X-ray image |
URL | 查看原文 |
收录类别 | SCI ; EI |
语种 | 英语 |
资助项目 | National Natural Science Foundation of China[ |
WOS研究方向 | Computer Science ; Engineering |
WOS类目 | Computer Science, Artificial Intelligence ; Engineering, Electrical & Electronic |
WOS记录号 | WOS:001410468800001 |
出版者 | Elsevier Ltd |
EI入藏号 | 20250417725230 |
EI主题词 | Self-supervised learning |
EI分类号 | 1101.2.1 |
原始文献类型 | Journal article (JA) |
文献类型 | 期刊论文 |
条目标识符 | https://kms.shanghaitech.edu.cn/handle/2MSLDSTB/483874 |
专题 | 生物医学工程学院_PI研究组_沈定刚组 信息科学与技术学院_硕士生 生物医学工程学院_PI研究组_王乾组 |
共同第一作者 | Zhao, Zihao |
通讯作者 | Wang, Qian |
作者单位 | 1.School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai, China; 2.School of Biomedical Engineering & State Key Laboratory of Advanced Medical Materials and Devices, ShanghaiTech University, Shanghai, China; 3.Department of Computer Science, University of Georgia, GA, United States; 4.School of Automation, Northwestern Polytechnical University, Xi'an, China; 5.Department of Research and Development, Shanghai United Imaging Intelligence Co., Ltd., Shanghai, China; 6.Shanghai Clinical Research and Trial Center, Shanghai, China |
通讯作者单位 | 上海科技大学 |
推荐引用方式 GB/T 7714 | Wang, Sheng,Zhao, Zihao,Zhuang, Zixu,et al. Learning better contrastive view from radiologist's gaze[J]. PATTERN RECOGNITION,2025,162. |
APA | Wang, Sheng.,Zhao, Zihao.,Zhuang, Zixu.,Ouyang, Xi.,Zhang, Lichi.,...&Wang, Qian.(2025).Learning better contrastive view from radiologist's gaze.PATTERN RECOGNITION,162. |
MLA | Wang, Sheng,et al."Learning better contrastive view from radiologist's gaze".PATTERN RECOGNITION 162(2025). |
条目包含的文件 | 下载所有文件 | |||||
文件名称/大小 | 文献类型 | 版本类型 | 开放类型 | 使用许可 |
修改评论
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。