See More and Know More: Zero-shot Point Cloud Segmentation via Multi-modal Visual Data
2023
会议录名称2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV)
ISSN1550-5499
页码21617-21627
发表状态已发表
DOI10.1109/ICCV51070.2023.01981
摘要Zero-shot point cloud segmentation aims to make deep models capable of recognizing novel objects in point cloud that are unseen in the training phase. Recent trends favor the pipeline which transfers knowledge from seen classes with labels to unseen classes without labels. They typically align visual features with semantic features obtained from word embedding by the supervision of seen classes' annotations. However, point cloud contains limited information to fully match with semantic features. In fact, the rich appearance information of images is a natural complement to the textureless point cloud, which is not well explored in previous literature. Motivated by this, we propose a novel multi-modal zero-shot learning method to better utilize the complementary information of point clouds and images for more accurate visual-semantic alignment. Extensive experiments are performed in two popular benchmarks, i.e., SemanticKITTI and nuScenes, and our method outperforms current SOTA methods with 52% and 49% improvement on average for unseen class mIoU, respectively. © 2023 IEEE.
关键词Point cloud compression Training Visualization Computer vision Zero-shot learning Semantic segmentation Semantics
会议名称2023 IEEE/CVF International Conference on Computer Vision, ICCV 2023
会议地点Paris, France
会议日期1-6 Oct. 2023
URL查看原文
收录类别EI
语种英语
出版者Institute of Electrical and Electronics Engineers Inc.
EI入藏号20241215793211
原始文献类型Conference article (CA)
来源库IEEE
引用统计
被引频次:14[WOS]   [WOS记录]     [WOS相关记录]
文献类型会议论文
条目标识符https://kms.shanghaitech.edu.cn/handle/2MSLDSTB/354924
专题信息科学与技术学院_PI研究组_马月昕
信息科学与技术学院_硕士生
共同第一作者Qi Jiang
通讯作者Yuexin Ma
作者单位
1.ShanghaiTech University
2.The University of Hong Kong
3.Shanghai AI Laboratory
4.The Chinese University of Hong Kong
第一作者单位上海科技大学
通讯作者单位上海科技大学
第一作者的第一单位上海科技大学
推荐引用方式
GB/T 7714
Yuhang Lu,Qi Jiang,Runnan Chen,et al. See More and Know More: Zero-shot Point Cloud Segmentation via Multi-modal Visual Data[C]:Institute of Electrical and Electronics Engineers Inc.,2023:21617-21627.
条目包含的文件 下载所有文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
个性服务
查看访问统计
谷歌学术
谷歌学术中相似的文章
[Yuhang Lu]的文章
[Qi Jiang]的文章
[Runnan Chen]的文章
百度学术
百度学术中相似的文章
[Yuhang Lu]的文章
[Qi Jiang]的文章
[Runnan Chen]的文章
必应学术
必应学术中相似的文章
[Yuhang Lu]的文章
[Qi Jiang]的文章
[Runnan Chen]的文章
相关权益政策
暂无数据
收藏/分享
文件名: 10.1109@ICCV51070.2023.01981.pdf
格式: Adobe PDF
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。