ShanghaiTech University Knowledge Management System
Unsupervised Monocular Depth Estimation Based on Hierarchical Feature-Guided Diffusion | |
2024-06-14 | |
状态 | 已发表 |
摘要 | Unsupervised monocular depth estimation has received widespread attention because of its capability to train without ground truth. In real-world scenarios, the images may be blurry or noisy due to the influence of weather conditions and inherent limitations of the camera. Therefore, it is particularly important to develop a robust depth estimation model. Benefiting from the training strategies of generative networks, generative-based methods often exhibit enhanced robustness. In light of this, we employ a well-converging diffusion model among generative networks for unsupervised monocular depth estimation. Additionally, we propose a hierarchical feature-guided denoising module. This model significantly enriches the model's capacity for learning and interpreting depth distribution by fully leveraging image features to guide the denoising process. Furthermore, we explore the implicit depth within reprojection and design an implicit depth consistency loss. This loss function serves to enhance the performance of the model and ensure the scale consistency of depth within a video sequence. We conduct experiments on the KITTI, Make3D, and our self-collected SIMIT datasets. The results indicate that our approach stands out among generative-based models, while also showcasing remarkable robustness. |
DOI | arXiv:2406.09782 |
相关网址 | 查看原文 |
出处 | Arxiv |
WOS记录号 | PPRN:89335125 |
WOS类目 | Computer Science, Software Engineering |
文献类型 | 预印本 |
条目标识符 | https://kms.shanghaitech.edu.cn/handle/2MSLDSTB/398569 |
专题 | 信息科学与技术学院 信息科学与技术学院_特聘教授组_张晓林组 信息科学与技术学院_硕士生 |
通讯作者 | Li, Jiamao |
作者单位 | 1.ShanghaiTech Univ, Sch Informat Sci & Technol, Shanghai 201210, Peoples R China 2.Chinese Acad Sci, Shanghai Inst Microsyst & Informat Technol, Bionic Vis Syst Lab, State Key Lab Transducer Technol, Shanghai 200050, Peoples R China 3.Univ Chinese Acad Sci, Beijing 100049, Peoples R China 4.Xiongan Inst Innovat, Xiongan 071700, Peoples R China 5.Univ Sci & Technol China, Hefei 230027, Anhui, Peoples R China |
推荐引用方式 GB/T 7714 | Liu, Runze,Zhu, Dongchen,Zhang, Guanghui,et al. Unsupervised Monocular Depth Estimation Based on Hierarchical Feature-Guided Diffusion. 2024. |
条目包含的文件 | ||||||
文件名称/大小 | 文献类型 | 版本类型 | 开放类型 | 使用许可 |
修改评论
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。