Abstract

Recovery of three dimensional (3D) shape and motion of non-static scenes from a monocular video sequence is important for applications like robot navigation and human computer interaction. If every point in the scene randomly moves, it is impossible to recover the non-rigid shapes. In practice, many non-rigid objects, e.g. the human face under various expressions, deform with certain structures. Their shapes can be regarded as a weighted combination of certain shape bases. Shape and motion recovery under such situations has attracted much interest. Previous work on this problem (Bregler, C., Hertzmann, A., and Biermann, H. 2000. In Proc. Int. Conf. Computer Vision and Pattern Recognition; Brand, M. 2001. In Proc. Int. Conf. Computer Vision and Pattern Recognition; Torresani, L., Yang, D., Alexander, G., and Bregler, C. 2001. In Proc. Int. Conf. Computer Vision and Pattern Recognition) utilized only orthonormality constraints on the camera rotations (rotation constraints). This paper proves that using only the rotation constraints results in ambiguous and invalid solutions. The ambiguity arises from the fact that the shape bases are not unique. An arbitrary linear transformation of the bases produces another set of eligible bases. To eliminate the ambiguity, we propose a set of novel constraints, basis constraints, which uniquely determine the shape bases. We prove that, under the weak-perspective projection model, enforcing both the basis and the rotation constraints leads to a closed-form solution to the problem of non-rigid shape and motion recovery. The accuracy and robustness of our closed-form solution is evaluated quantitatively on synthetic data and qualitatively on real video sequences.

Highlights

  • Many years of work in structure from motion have led to significant successes in recovery of 3D shapes and motion estimates from 2D monocular videos

  • While it is impossible to reconstruct the shape if the scene deforms arbitrarily, in practice, many non-rigid objects, e.g. the human face under various expressions, deform with a class of structures

  • This paper proves that, under the weak-perspective projection model, when both the basis and rotation constraints are imposed, a closed-form solution to the problem of non-rigid shape and motion recovery is achieved

Read more

Summary

Introduction

Many years of work in structure from motion have led to significant successes in recovery of 3D shapes and motion estimates from 2D monocular videos. While it is impossible to reconstruct the shape if the scene deforms arbitrarily, in practice, many non-rigid objects, e.g. the human face under various expressions, deform with a class of structures. They try to recover the non-rigid shape and motion, and the shape model This class of approaches so far has utilized only the orthonormality constraints on camera rotations. As shown in this paper, enforcing only the rotation constraints leads to ambiguous and invalid solutions These approaches cannot guarantee the desired solution. This paper proves that, under the weak-perspective projection model, when both the basis and rotation constraints are imposed, a closed-form solution to the problem of non-rigid shape and motion recovery is achieved. We develop a factorization method that applies both metric constraints to compute the closed-form solution for the non-rigid shape, motion, and shape bases

Previous Work
Problem Statement
Metric Constraints
Rotation Constraints
Why Are Rotation Constraints Not Sufficient?
Basis Constraints
A Closed-Form Solution
Performance Evaluation
Comparison with Three Previous Methods
Quantitative Evaluation on Synthetic Data
Qualitative Evaluation on Real Video Sequences
Findings
Conclusion and Discussion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.