| |||||||
ShanghaiTech University Knowledge Management System
LaserHuman: Language-guided Scene-aware Human Motion Generation in Free Environment | |
2024-03-21 | |
状态 | 已发表 |
摘要 | Language-guided scene-aware human motion generation has great significance for entertainment and robotics. In response to the limitations of existing datasets, we introduce LaserHuman, a pioneering dataset engineered to revolutionize Scene-Text-to-Motion research. LaserHuman stands out with its inclusion of genuine human motions within 3D environments, unbounded free-form natural language descriptions, a blend of indoor and outdoor scenarios, and dynamic, ever-changing scenes. Diverse modalities of capture data and rich annotations present great opportunities for the research of conditional motion generation, and can also facilitate the development of real-life applications. Moreover, to generate semantically consistent and physically plausible human motions, we propose a multi-conditional diffusion model, which is simple but effective, achieving state-of-the-art performance on existing datasets. |
DOI | arXiv:2403.13307 |
相关网址 | 查看原文 |
出处 | Arxiv |
WOS记录号 | PPRN:88259911 |
WOS类目 | Computer Science, Software Engineering |
文献类型 | 预印本 |
条目标识符 | https://kms.shanghaitech.edu.cn/handle/2MSLDSTB/372951 |
专题 | 信息科学与技术学院_博士生 信息科学与技术学院_硕士生 信息科学与技术学院_本科生 信息科学与技术学院_PI研究组_马月昕 |
通讯作者 | Ma, Yuexin |
作者单位 | 1.ShanghaiTech Univ, Shanghai, Peoples R China 2.Univ Penn, Philadelphia, PA, USA 3.Univ Adelaide, Adelaide, Australia 4.Univ Sci & Technol China, Hefei, Peoples R China 5.Univ Hong Kong, Hong Kong, Peoples R China 6.Chinese Univ Hong Kong, Hong Kong, Peoples R China |
推荐引用方式 GB/T 7714 | Cong, Peishan,Wang, Ziyi,Dou, Zhiyang,et al. LaserHuman: Language-guided Scene-aware Human Motion Generation in Free Environment. 2024. |
条目包含的文件 | ||||||
文件名称/大小 | 文献类型 | 版本类型 | 开放类型 | 使用许可 |
修改评论
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。