ShanghaiTech University Knowledge Management System
A Smart Interactive Camera Robot Based on Large Language Models | |
2023-12-09 | |
会议录名称 | 2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO)
![]() |
发表状态 | 已发表 |
DOI | 10.1109/ROBIO58561.2023.10354952 |
摘要 | The emergence of large language models (LLMs) has paved the way for advancing robotics capabilities, especially in intricate tasks that demand nuanced comprehension and precision. In this context, this paper introduces a novel interactive camera robot that harnesses LLMs to enhance human-robot interaction and optimize robot control. Specifically, an innovative technique that leverages the language understanding capabilities of LLMs to plan camera movement trajectories and waypoints is presented. In this study, the geometric relationships among the objects under capture are employed to plan the control strategy. Accordingly, this approach not only empowers sophisticated camera parameter manipulation and color adjustments but also fosters a natural and efficient human-robot interaction. Lots of experiments on real robots are conducted to evaluate the effectiveness of the proposed method under various scenarios. The results reveal robust performance across crucial measures, affirming the substantial potential of LLMs in elevating camera robot control and interaction experience. Videos of our experiments are available at https://youtu.be/zP-sTZHvXe4. © 2023 IEEE. |
关键词 | Vocabulary Tracking Robot vision systems Robot control Human-robot interaction Cameras Trajectory |
会议名称 | 2023 IEEE International Conference on Robotics and Biomimetics, ROBIO 2023 |
会议地点 | Koh Samui, Thailand |
会议日期 | 4-9 Dec. 2023 |
URL | 查看原文 |
收录类别 | EI |
语种 | 英语 |
出版者 | Institute of Electrical and Electronics Engineers Inc. |
EI入藏号 | 20240315404603 |
原始文献类型 | Conference article (CA) |
来源库 | IEEE |
文献类型 | 会议论文 |
条目标识符 | https://kms.shanghaitech.edu.cn/handle/2MSLDSTB/349510 |
专题 | 信息科学与技术学院 信息科学与技术学院_PI研究组_白卫邦组 |
通讯作者 | Zhu, Guo-Niu |
作者单位 | 1.Fudan University, School of Information Science and Technology, Shanghai; 200433, China 2.Fudan University, Academy for Engineering and Technology, Shanghai; 200433, China 3.ShanghaiTech University, School of Information Science and Technology, Shanghai; 201210, China |
推荐引用方式 GB/T 7714 | Bao, Zeyu,Zhu, Guo-Niu,Ding, Wenchao,et al. A Smart Interactive Camera Robot Based on Large Language Models[C]:Institute of Electrical and Electronics Engineers Inc.,2023. |
条目包含的文件 | ||||||
文件名称/大小 | 文献类型 | 版本类型 | 开放类型 | 使用许可 |
修改评论
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。