LaserHuman: Language-guided Scene-aware Human Motion Generation in Free Environment

doi:arXiv:2403.13307

	LaserHuman: Language-guided Scene-aware Human Motion Generation in Free Environment
	Cong, Peishan1 ; Wang, Ziyi1 ; Dou, Zhiyang 2,5; Ren, Yiming1 ; Yin, Wei 3; Cheng, Kai 4; Sun, Yujing 5; Long, Xiaoxiao 5; Zhu, Xinge 6; Ma, Yuexin1
	2024-03-21
状态	已发表
摘要	Language-guided scene-aware human motion generation has great significance for entertainment and robotics. In response to the limitations of existing datasets, we introduce LaserHuman, a pioneering dataset engineered to revolutionize Scene-Text-to-Motion research. LaserHuman stands out with its inclusion of genuine human motions within 3D environments, unbounded free-form natural language descriptions, a blend of indoor and outdoor scenarios, and dynamic, ever-changing scenes. Diverse modalities of capture data and rich annotations present great opportunities for the research of conditional motion generation, and can also facilitate the development of real-life applications. Moreover, to generate semantically consistent and physically plausible human motions, we propose a multi-conditional diffusion model, which is simple but effective, achieving state-of-the-art performance on existing datasets.
DOI	arXiv:2403.13307
相关网址	查看原文
出处	Arxiv
WOS记录号	PPRN:88259911
WOS类目	Computer Science, Software Engineering
文献类型	预印本
条目标识符	https://kms.shanghaitech.edu.cn/handle/2MSLDSTB/372951
专题	信息科学与技术学院_博士生信息科学与技术学院_硕士生信息科学与技术学院_本科生信息科学与技术学院_PI研究组_马月昕
通讯作者	Ma, Yuexin
作者单位	1.ShanghaiTech Univ, Shanghai, Peoples R China 2.Univ Penn, Philadelphia, PA, USA 3.Univ Adelaide, Adelaide, Australia 4.Univ Sci & Technol China, Hefei, Peoples R China 5.Univ Hong Kong, Hong Kong, Peoples R China 6.Chinese Univ Hong Kong, Hong Kong, Peoples R China
推荐引用方式 GB/T 7714	Cong, Peishan,Wang, Ziyi,Dou, Zhiyang,et al. LaserHuman: Language-guided Scene-aware Human Motion Generation in Free Environment. 2024.