CoTDet: Affordance Knowledge Prompting for Task Driven Object Detection
2023-09-03
会议录名称ARXIV
ISSN1550-5499
发表状态已发表
DOIarXiv:2309.01093
摘要

Task driven object detection aims to detect object instances suitable for affording a task in an image. Its challenge lies in object categories available for the task being too diverse to be limited to a closed set of object vocabulary for traditional object detection. Simply mapping categories and visual features of common objects to the task cannot address the challenge. In this paper, we propose to explore fundamental affordances rather than object categories, i.e., common attributes that enable different objects to accomplish the same task. Moreover, we propose a novel multi-level chain-of-thought prompting (MLCoT) to extract the affordance knowledge from large language models, which contains multi-level reasoning steps from task to object examples to essential visual attributes with rationales. Furthermore, to fully exploit knowledge to benefit object recognition and localization, we propose a knowledge-conditional detection framework, namely CoTDet. It conditions the detector from the knowledge to generate object queries and regress boxes. Experimental results demonstrate that our CoTDet outperforms state-of-the-art methods consistently and significantly (+15.6 box AP and +14.8 mask AP) and can generate rationales for why objects are detected to afford the task.

会议地点Paris, France
会议日期1-6 Oct. 2023
URL查看原文
资助项目National Natural Science Foundation of China[
WOS类目Computer Science, Software Engineering
WOS记录号PPRN:84731264
来源库IEEE
文献类型会议论文
条目标识符https://kms.shanghaitech.edu.cn/handle/2MSLDSTB/348025
专题信息科学与技术学院
信息科学与技术学院_PI研究组_虞晶怡组
信息科学与技术学院_硕士生
信息科学与技术学院_博士生
信息科学与技术学院_PI研究组_杨思蓓组
作者单位
ShanghaiTech Univ, Sch Informat Sci & Technol, Shanghai, Peoples R China
第一作者单位信息科学与技术学院
第一作者的第一单位信息科学与技术学院
推荐引用方式
GB/T 7714
Tang, Jiajin,Zheng, Ge,Yu, Jingyi,et al. CoTDet: Affordance Knowledge Prompting for Task Driven Object Detection[C],2023.
条目包含的文件 下载所有文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
个性服务
查看访问统计
谷歌学术
谷歌学术中相似的文章
[Tang, Jiajin]的文章
[Zheng, Ge]的文章
[Yu, Jingyi]的文章
百度学术
百度学术中相似的文章
[Tang, Jiajin]的文章
[Zheng, Ge]的文章
[Yu, Jingyi]的文章
必应学术
必应学术中相似的文章
[Tang, Jiajin]的文章
[Zheng, Ge]的文章
[Yu, Jingyi]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。