Bridging Language and Geometric Primitives for Zero-shot Point Cloud Segmentation

doi:10.1145/3581783.3612409

	Bridging Language and Geometric Primitives for Zero-shot Point Cloud Segmentation
	Chen, Runnan 1; Zhu, Xinge 2; Chen, Nenglun 1; Li, Wei 3; Ma, Yuexin4 ; Yang, Ruigang 3; Wang, Wenping 5
	2023-10-26
会议录名称	MM 2023 - PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA
页码	5380-5388
发表状态	已发表
DOI	10.1145/3581783.3612409
摘要	We investigate transductive zero-shot point cloud semantic segmentation, where the network is trained on seen objects and able to segment unseen objects. The 3D geometric elements are essential cues to imply a novel 3D object type. However, previous methods neglect the fine-grained relationship between the language and the 3D geometric elements. To this end, we propose a novel framework to learn the geometric primitives shared in seen and unseen categories' objects and employ a fine-grained alignment between language and the learned geometric primitives. Therefore, guided by language, the network recognizes the novel objects represented with geometric primitives. Specifically, we formulate a novel point visual representation, the similarity vector of the point's feature to the learnable prototypes, where the prototypes automatically encode geometric primitives via back-propagation. Besides, we propose a novel Unknown-aware InfoNCE Loss to fine-grained align the visual representation with language. Extensive experiments show that our method significantly outperforms other state-of-the-art methods in the harmonic mean-intersection-over-union (hIoU), with the improvement of 17.8%, 30.4%, 9.2% and 7.9% on S3DIS, ScanNet, SemanticKITTI and nuScenes datasets, respectively. Codes are available1 https://github.com/runnanchen/Zero-Shot-Point-Cloud-Segmentation. © 2023 ACM.
会议录编者/会议主办者	ACM SIGMM
关键词	Geometry Semantic Segmentation Semantics Visual languages Zero-shot learning 3D object Fine grained Fine-grained relationships Geometric elements Geometric primitives Learn+ Point cloud segmentation Point-clouds Semantic segmentation Visual representations
会议名称	31st ACM International Conference on Multimedia, MM 2023
会议地点	Ottawa, ON, Canada
会议日期	October 29, 2023 - November 3, 2023
收录类别	EI
语种	英语
出版者	Association for Computing Machinery, Inc
EI入藏号	20235015224744
EI主题词	Backpropagation
EI分类号	723.1.1 Computer Programming Languages ; 723.4 Artificial Intelligence ; 921 Mathematics
原始文献类型	Conference article (CA)
引用统计	正在获取...
文献类型	会议论文
条目标识符	https://kms.shanghaitech.edu.cn/handle/2MSLDSTB/348711
专题	信息科学与技术学院_PI研究组_马月昕
作者单位	1.The University of Hong Kong, Hong Kong 2.The Chinese University of Hong Kong, Hong Kong 3.Inceptio, United States 4.ShanghaiTech University, China 5.Texas A&m University, United States
推荐引用方式 GB/T 7714	Chen, Runnan,Zhu, Xinge,Chen, Nenglun,et al. Bridging Language and Geometric Primitives for Zero-shot Point Cloud Segmentation[C]//ACM SIGMM:Association for Computing Machinery, Inc,2023:5380-5388.