MVP-Human Dataset for 3-D Clothed Human Avatar Reconstruction From Multiple Frames

Xiangyu Zhu,Xiaomei Zhang,Zhiwen Chen,Qiong Cao,Yunfeng Wang,Zhen Lei,Kan Guo,Tingting Liao,Jiangjing Lyu,Stan Z Li

doi:10.1109/tbiom.2023.3276901

Xiangyu Zhu, Xiaomei Zhang + Show 8 more

https://doi.org/10.1109/tbiom.2023.3276901

Copy DOI

Export

Save

Cite

Abstract
Full-Text
Similar Papers

Abstract

Listen

In this paper, we consider a novel problem of reconstructing a 3D clothed human avatar from multiple frames, independent of assumptions on camera calibration, capture space, and constrained actions. We contribute a large-scale dataset, Multi-View and multi-Pose 3D human (MVP-Human in short) to help address this problem. The dataset contains 400 subjects, each of which has 15 scans in different poses and 8-view images for each pose, providing 6,000 3D scans and 48,000 images in total. In addition, a baseline method that takes multiple images as inputs, and generates a shape-with-skinning avatar in the canonical space, finished in one feed-forward pass is proposed. It first reconstructs the implicit skinning fields in a multi-level manner, and then the image features from multiple images are aligned and integrated to estimate a pixel-aligned implicit function that represents the clothed shape. With the newly collected dataset and the baseline method, it shows promising performance on 3D clothed avatar reconstruction. We release the MVP-Human dataset and the baseline method in, hoping to promote research and development in this field.

Full Text