3D StreetUnveiler with Semantic-Aware 2DGS

doi:arXiv:2405.18416

	3D StreetUnveiler with Semantic-Aware 2DGS
	Xu, Jingwei1 ; Wang, Yikai 2; Zhao, Yiqun 1; Fu, Yanwei 2; Gao, Shenghua1
	2024-05-28
状态	已发表
摘要	Unveiling an empty street from crowded observations captured by in-car cameras is crucial for autonomous driving. However, removing all temporary static objects, such as stopped vehicles and standing pedestrians, presents a significant challenge. Unlike object-centric 3D inpainting, which relies on thorough observation in a small scene, street scenes involve long trajectories that differ from previous 3D inpainting tasks. The camera-centric moving environment of captured videos further complicates the task due to the limited degree and time duration of object observation. To address these obstacles, we introduce StreetUnveiler to reconstruct an empty street. StreetUnveiler learns a 3D representation of the empty street from crowded observations. Our representation is based on the hard-label semantic 2D Gaussian Splatting (2DGS) for its scalability and ability to identify Gaussians to be removed. We inpaint rendered image after removing unwanted Gaussians to provide pseudo-labels and subsequently re-optimize the 2DGS. Given its temporal continuous movement, we divide the empty street scene into observed, partial-observed, and unobserved regions, which we propose to locate through a rendered alpha map. This decomposition helps us to minimize the regions that need to be inpainted. To enhance the temporal consistency of the inpainting, we introduce a novel time-reversal framework to inpaint frames in reverse order and use later frames as references for earlier frames to fully utilize the long-trajectory observations. Our experiments conducted on the street scene dataset successfully reconstructed a 3D representation of the empty street. The mesh representation of the empty street can be extracted for further applications.Â
DOI	arXiv:2405.18416
相关网址	查看原文
出处	Arxiv
WOS记录号	PPRN:89091147
WOS类目	Computer Science, Software Engineering
文献类型	预印本
条目标识符	https://kms.shanghaitech.edu.cn/handle/2MSLDSTB/387319
专题	信息科学与技术学院_硕士生信息科学与技术学院_PI研究组_高盛华组
通讯作者	Gao, Shenghua
作者单位	1.ShanghaiTech Univ, Shanghai, Peoples R China 2.Fudan Univ, Shanghai, Peoples R China
推荐引用方式 GB/T 7714	Xu, Jingwei,Wang, Yikai,Zhao, Yiqun,et al. 3D StreetUnveiler with Semantic-Aware 2DGS. 2024.