A Smart Interactive Camera Robot Based on Large Language Models
2023-12-09
会议录名称2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO)
发表状态已发表
DOI10.1109/ROBIO58561.2023.10354952
摘要The emergence of large language models (LLMs) has paved the way for advancing robotics capabilities, especially in intricate tasks that demand nuanced comprehension and precision. In this context, this paper introduces a novel interactive camera robot that harnesses LLMs to enhance human-robot interaction and optimize robot control. Specifically, an innovative technique that leverages the language understanding capabilities of LLMs to plan camera movement trajectories and waypoints is presented. In this study, the geometric relationships among the objects under capture are employed to plan the control strategy. Accordingly, this approach not only empowers sophisticated camera parameter manipulation and color adjustments but also fosters a natural and efficient human-robot interaction. Lots of experiments on real robots are conducted to evaluate the effectiveness of the proposed method under various scenarios. The results reveal robust performance across crucial measures, affirming the substantial potential of LLMs in elevating camera robot control and interaction experience. Videos of our experiments are available at https://youtu.be/zP-sTZHvXe4. © 2023 IEEE.
关键词Vocabulary Tracking Robot vision systems Robot control Human-robot interaction Cameras Trajectory
会议名称2023 IEEE International Conference on Robotics and Biomimetics, ROBIO 2023
会议地点Koh Samui, Thailand
会议日期4-9 Dec. 2023
URL查看原文
收录类别EI
语种英语
出版者Institute of Electrical and Electronics Engineers Inc.
EI入藏号20240315404603
原始文献类型Conference article (CA)
来源库IEEE
文献类型会议论文
条目标识符https://kms.shanghaitech.edu.cn/handle/2MSLDSTB/349510
专题信息科学与技术学院
信息科学与技术学院_PI研究组_白卫邦组
通讯作者Zhu, Guo-Niu
作者单位
1.Fudan University, School of Information Science and Technology, Shanghai; 200433, China
2.Fudan University, Academy for Engineering and Technology, Shanghai; 200433, China
3.ShanghaiTech University, School of Information Science and Technology, Shanghai; 201210, China
推荐引用方式
GB/T 7714
Bao, Zeyu,Zhu, Guo-Niu,Ding, Wenchao,et al. A Smart Interactive Camera Robot Based on Large Language Models[C]:Institute of Electrical and Electronics Engineers Inc.,2023.
条目包含的文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
个性服务
查看访问统计
谷歌学术
谷歌学术中相似的文章
[Bao, Zeyu]的文章
[Zhu, Guo-Niu]的文章
[Ding, Wenchao]的文章
百度学术
百度学术中相似的文章
[Bao, Zeyu]的文章
[Zhu, Guo-Niu]的文章
[Ding, Wenchao]的文章
必应学术
必应学术中相似的文章
[Bao, Zeyu]的文章
[Zhu, Guo-Niu]的文章
[Ding, Wenchao]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。