Exploiting Spatial-Temporal Context for Interacting Hand Reconstruction on Monocular RGB Video

Weichao Zhao,Hezhen Hu,Wengang Zhou,Li Li,Houqiang Li

doi:10.1145/3639707

Abstract

Reconstructing interacting hands from monocular RGB data is a challenging task, as it involves many interfering factors, e.g., self- and mutual occlusion and similar textures. Previous works only leverage information from a single RGB image without modeling their physically plausible relation, which leads to inferior reconstruction results. In this work, we are dedicated to explicitly exploiting spatial-temporal information to achieve better interacting hand reconstruction. On the one hand, we leverage temporal context to complement insufficient information provided by the single frame and design a novel temporal framework with a temporal constraint for interacting hand motion smoothness. On the other hand, we further propose an interpenetration detection module to produce kinetically plausible interacting hands without physical collisions. Extensive experiments are performed to validate the effectiveness of our proposed framework, which achieves new state-of-the-art performance on public benchmarks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Exploiting Spatial-Temporal Context for Interacting Hand Reconstruction on Monocular RGB Video

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Multimedia Computing, Communications, and Applications

Lead the way for us

Similar Papers

Towards Spectral Estimation from a Single RGB Image in the Wild
Berk Kaya ... Yigit Baran Can
-
Berk Kaya, et. al.Berk Kaya ... Yigit Baran Can
01 Oct 2019
01 Oct 2019

Hyperspectral reconstruction from a single textile RGB image based on the generative adversarial network
Yue Liu ... Jianxin Zhang
Textile Research Journal | VOL. 93
Yue Liu, et. al.Yue Liu ... Jianxin Zhang
15 Aug 2022
Textile Research Journal | VOL. 93

Deep Learning in Hyperspectral Image Reconstruction from Single RGB images—A Case Study on Tomato Quality Parameters
Jiangsan Zhao ... Jihong Liu Clarke
Remote Sensing | VOL. 12
Jiangsan Zhao, et. al.Jiangsan Zhao ... Jihong Liu Clarke
07 Oct 2020
Remote Sensing | VOL. 12

Generate What You Can’t See - a View-dependent Image Generation
Karol Piaskowski ... Dominik Belter
-
Karol Piaskowski, et. al.Karol Piaskowski ... Dominik Belter
01 Nov 2019
01 Nov 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Exploiting Spatial-Temporal Context for Interacting Hand Reconstruction on Monocular RGB Video

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Multimedia Computing, Communications, and Applications