| |||||||
ShanghaiTech University Knowledge Management System
3D StreetUnveiler with Semantic-Aware 2DGS | |
2024-05-28 | |
状态 | 已发表 |
摘要 | Unveiling an empty street from crowded observations captured by in-car cameras is crucial for autonomous driving. However, removing all temporary static objects, such as stopped vehicles and standing pedestrians, presents a significant challenge. Unlike object-centric 3D inpainting, which relies on thorough observation in a small scene, street scenes involve long trajectories that differ from previous 3D inpainting tasks. The camera-centric moving environment of captured videos further complicates the task due to the limited degree and time duration of object observation. To address these obstacles, we introduce StreetUnveiler to reconstruct an empty street. StreetUnveiler learns a 3D representation of the empty street from crowded observations. Our representation is based on the hard-label semantic 2D Gaussian Splatting (2DGS) for its scalability and ability to identify Gaussians to be removed. We inpaint rendered image after removing unwanted Gaussians to provide pseudo-labels and subsequently re-optimize the 2DGS. Given its temporal continuous movement, we divide the empty street scene into observed, partial-observed, and unobserved regions, which we propose to locate through a rendered alpha map. This decomposition helps us to minimize the regions that need to be inpainted. To enhance the temporal consistency of the inpainting, we introduce a novel time-reversal framework to inpaint frames in reverse order and use later frames as references for earlier frames to fully utilize the long-trajectory observations. Our experiments conducted on the street scene dataset successfully reconstructed a 3D representation of the empty street. The mesh representation of the empty street can be extracted for further applications. |
DOI | arXiv:2405.18416 |
相关网址 | 查看原文 |
出处 | Arxiv |
WOS记录号 | PPRN:89091147 |
WOS类目 | Computer Science, Software Engineering |
文献类型 | 预印本 |
条目标识符 | https://kms.shanghaitech.edu.cn/handle/2MSLDSTB/387319 |
专题 | 信息科学与技术学院_硕士生 信息科学与技术学院_PI研究组_高盛华组 |
通讯作者 | Gao, Shenghua |
作者单位 | 1.ShanghaiTech Univ, Shanghai, Peoples R China 2.Fudan Univ, Shanghai, Peoples R China |
推荐引用方式 GB/T 7714 | Xu, Jingwei,Wang, Yikai,Zhao, Yiqun,et al. 3D StreetUnveiler with Semantic-Aware 2DGS. 2024. |
条目包含的文件 | ||||||
文件名称/大小 | 文献类型 | 版本类型 | 开放类型 | 使用许可 |
修改评论
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。