ShanghaiTech University Knowledge Management System
BOTH2Hands: Inferring 3D Hands from Both Text Prompts and Body Dynamics | |
2024-04-10 | |
会议录名称 | CVPR |
ISSN | 1063-6919 |
发表状态 | 已发表 |
DOI | https://doi.org/10.1109/CVPR52733.2024.00232 |
摘要 | The recently emerging text -to -motion advances have inspired numerous attempts for convenient and interactive human motion generation. Yet, existing methods are largely limited to generating body motions only without considering the rich two -hand motions, let alone handling various conditions like body dynamics or texts. To break the data bottleneck, we propose BOTH57M, a novel multi -modal dataset for two -hand motion generation. Our dataset includes accurate motion tracking for the human body and hands and provides pair -wised finger -level hand annotations and body descriptions. We further provide a strong baseline method, BOTH2Hands, for the novel task: generating vivid two -hand motions from both implicit body dynamics and explicit text prompts. We first warm up two parallel body -to -hand and text -to -hand diffusion models and then utilize the cross -attention transformer for motion blending. Extensive experiments and cross -validations demonstrate the effectiveness of our approach and dataset for generating convincing two -hand motions from the hybrid body -and -textual conditions. Our dataset and code will be released to the community for future research, which can be found at github. |
会议地点 | Seattle, WA, USA |
会议日期 | 16-22 June 2024 |
URL | 查看原文 |
收录类别 | SCI |
语种 | 英语 |
资助项目 | National Key R&D Program of China[ |
WOS类目 | Computer Science, Software Engineering |
WOS记录号 | PPRN:86572371 |
来源库 | IEEE |
文献类型 | 会议论文 |
条目标识符 | https://kms.shanghaitech.edu.cn/handle/2MSLDSTB/372919 |
专题 | 信息科学与技术学院_硕士生 信息科学与技术学院_PI研究组_虞晶怡组 信息科学与技术学院_本科生 信息科学与技术学院_博士生 信息科学与技术学院_PI研究组_许岚组 信息科学与技术学院_PI研究组_汪婧雅组 |
通讯作者 | Zhang, Wenqian |
作者单位 | ShanghaiTech Univ, Shanghai, Peoples R China |
第一作者单位 | 上海科技大学 |
通讯作者单位 | 上海科技大学 |
第一作者的第一单位 | 上海科技大学 |
推荐引用方式 GB/T 7714 | Zhang, Wenqian,Huang, Molin,Zhou, Yuxuan,et al. BOTH2Hands: Inferring 3D Hands from Both Text Prompts and Body Dynamics[C],2024. |
条目包含的文件 | ||||||
文件名称/大小 | 文献类型 | 版本类型 | 开放类型 | 使用许可 |
修改评论
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。