ShanghaiTech University Knowledge Management System
Gaze-guided Hand-Object Interaction Synthesis: Benchmark and Method | |
2024-03-28 | |
状态 | 已发表 |
摘要 | Gaze plays a crucial role in revealing human attention and intention, particularly in hand-object interaction scenarios, where it guides and synchronizes complex tasks that require precise coordination between the brain, hand, and object. Motivated by this, we introduce a novel task: Gaze-Guided Hand-Object Interaction Synthesis, with potential applications in augmented reality, virtual reality, and assistive technologies. To support this task, we present GazeHOI, the first dataset to capture simultaneous 3D modeling of gaze, hand, and object interactions. This task poses significant challenges due to the inherent sparsity and noise in gaze data, as well as the need for high consistency and physical plausibility in generating hand and object motions. To tackle these issues, we propose a stacked gaze-guided hand-object interaction diffusion model, named GHO-Diffusion. The stacked design effectively reduces the complexity of motion generation. We also introduce HOI-Manifold Guidance during the sampling stage of GHO-Diffusion, enabling fine-grained control over generated motions while maintaining the data manifold. Additionally, we propose a spatial-temporal gaze feature encoding for the diffusion condition and select diffusion results based on consistency scores between gaze-contact maps and gaze-interaction trajectories. Extensive experiments highlight the effectiveness of our method and the unique contributions of our dataset. More details in https://takiee.github.io/gaze-hoi/. |
DOI | arXiv:2403.16169 |
相关网址 | 查看原文 |
出处 | Arxiv |
WOS记录号 | PPRN:88291727 |
WOS类目 | Computer Science, Software Engineering |
文献类型 | 预印本 |
条目标识符 | https://kms.shanghaitech.edu.cn/handle/2MSLDSTB/372944 |
专题 | 信息科学与技术学院_硕士生 信息科学与技术学院_PI研究组_虞晶怡组 信息科学与技术学院_本科生 信息科学与技术学院_PI研究组_许岚组 信息科学与技术学院_PI研究组_马月昕 信息科学与技术学院_PI研究组_汪婧雅组 信息科学与技术学院_PI研究组_石野组 |
通讯作者 | Tian, Jie |
作者单位 | ShanghaiTech Univ, Shanghai, Peoples R China |
推荐引用方式 GB/T 7714 | Tian, Jie,Yang, Lingxiao,Ji, Ran,et al. Gaze-guided Hand-Object Interaction Synthesis: Benchmark and Method. 2024. |
条目包含的文件 | ||||||
文件名称/大小 | 文献类型 | 版本类型 | 开放类型 | 使用许可 |
修改评论
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。