ShanghaiTech University Knowledge Management System
Human Mesh Recovery from Arbitrary Multi-view Images | |
2024-03-20 | |
状态 | 已发表 |
摘要 | Human mesh recovery from arbitrary multi-view images involves two characteristics: the arbitrary camera poses and arbitrary number of camera views. Because of the variability, designing a unified framework to tackle this task is challenging. The challenges can be summarized as the dilemma of being able to simultaneously estimate arbitrary camera poses and recover human mesh from arbitrary multi-view images while maintaining flexibility. To solve this dilemma, we propose a divide and conquer framework for Unified Human Mesh Recovery (U-HMR) from arbitrary multi-view images. In particular, U-HMR consists of a decoupled structure and two main components: camera and body decoupling (CBD), camera pose estimation (CPE), and arbitrary view fusion (AVF). As camera poses and human body mesh are independent of each other, CBD splits the estimation of them into two sub-tasks for two individual sub-networks (ie, CPE and AVF) to handle respectively, thus the two sub-tasks are disentangled. In CPE, since each camera pose is unrelated to the others, we adopt a shared MLP to process all views in a parallel way. In AVF, in order to fuse multi-view information and make the fusion operation independent of the number of views, we introduce a transformer decoder with a SMPL parameters query token to extract cross-view features for mesh recovery. To demonstrate the efficacy and flexibility of the proposed framework and effect of each component, we conduct extensive experiments on three public datasets: Human3.6M, MPI-INF-3DHP, and TotalCapture. |
关键词 | Human mesh recovery Arbitrary multi-view images Divide and conquer |
DOI | arXiv:2403.12434 |
相关网址 | 查看原文 |
出处 | Arxiv |
WOS记录号 | PPRN:88221373 |
WOS类目 | Computer Science, Software Engineering |
文献类型 | 预印本 |
条目标识符 | https://kms.shanghaitech.edu.cn/handle/2MSLDSTB/372955 |
专题 | 信息科学与技术学院_硕士生 生物医学工程学院_PI研究组_沈定刚组 |
通讯作者 | Li, Xiaoben |
作者单位 | 1.ShanghaiTech Univ, Shanghai, Peoples R China 2.United Imaging Intelligence, Shanghai, Peoples R China |
推荐引用方式 GB/T 7714 | Li, Xiaoben,Meng, Mancheng,Wu, Ziyan,et al. Human Mesh Recovery from Arbitrary Multi-view Images. 2024. |
条目包含的文件 | ||||||
文件名称/大小 | 文献类型 | 版本类型 | 开放类型 | 使用许可 |
修改评论
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。