Avatar Creation Research Articles

Facial mesh tracking enables the production of topologically consistent 3D facial meshes from stereo video input captured by calibrated cameras. This technology is an integral part of many digital human applications, such as personalized avatar creation, audio-driven 3D facial animation, and talking face video generation. Currently, most facial mesh tracking methods are built on computer graphics techniques, which involve complex procedures and often necessitate human annotation within pipelines. As a result, these approaches are difficult to implement and hard to generalize across various scenarios. We propose a backpropagation-based solution that formulates facial mesh tracking as a differentiable optimization problem called the BPMT. Our solution leverages visual clues extracted from the stereo input to estimate vertex-wise geometry and texture information. The BPMT is composed of two steps: automatic face analysis and mesh tracking. In the first step, a range of visual clues are automatically extracted from the input, including facial point clouds, multi-view 2D landmarks, 3D landmarks in the world coordinate system, motion fields, and image masks. The second step can be viewed as a differentiable optimization problem, with constraints comprising stereo video input and facial clues. The primary objective is to achieve topologically consistent 3D facial meshes across frames. Additionally, the parameters to be optimized encompass the positions of free-form deformed vertices and a shared texture UV map. Furthermore, the 3D morphable model (3DMM) is introduced as a form of regularization to enhance the convergence of the optimization process. Leveraging the fully developed backpropagation software, we progressively register the facial meshes to the recorded object, generating high-quality 3D faces with consistent topologies. The BPMT requires no manual labeling within the pipeline, making it suitable for producing large-scale stereo facial data. Moreover, our method exhibits a high degree of flexibility and extensibility, positioning it as a promising platform for future research in the community.

Read full abstract

Our goal is to efficiently learn personalized animatable 3D head avatars from videos that are geometrically accurate, realistic, relightable, and compatible with current rendering systems. While 3D meshes enable efficient processing and are highly portable, they lack realism in terms of shape and appearance. Neural representations, on the other hand, are realistic but lack compatibility and are slow to train and render. Our key insight is that it is possible to efficiently learn high-fidelity 3D mesh representations via differentiable rendering by exploiting highly-optimized methods from traditional computer graphics and approximating some of the components with neural networks. To that end, we introduce FLARE, a technique that enables the creation of animatable and relightable mesh avatars from a single monocular video. First, we learn a canonical geometry using a mesh representation, enabling efficient differentiable rasterization and straightforward animation via learned blendshapes and linear blend skinning weights. Second, we follow physically-based rendering and factor observed colors into intrinsic albedo, roughness, and a neural representation of the illumination, allowing the learned avatars to be relit in novel scenes. Since our input videos are captured on a single device with a narrow field of view, modeling the surrounding environment light is non-trivial. Based on the split-sum approximation for modeling specular reflections, we address this by approximating the prefiltered environment map with a multi-layer perceptron (MLP) modulated by the surface roughness, eliminating the need to explicitly model the light. We demonstrate that our mesh-based avatar formulation, combined with learned deformation, material, and lighting MLPs, produces avatars with high-quality geometry and appearance, while also being efficient to train and render compared to existing approaches.

Read full abstract

Avatar Creation Research Articles

Articles published on Avatar Creation

Building Consistent Characters through Open-Source Generative AI

Global point cloud registration network for large transformations

Mesh representation matters: investigating the influence of different mesh features on perceptual and spatial fidelity of deep 3D morphable models

3DFaceShop: Explicitly Controllable 3D-Aware Portrait Generation.

Does Sense of Presence Affect People's Politeness? Comparing VRChat and Soul App

Formulating facial mesh tracking as a differentiable optimization problem: a backpropagation-based solution

Creating a metaverse‐me: Exploring the consumer avatar creation process

Customized Avatars In A Multiplatform Game On Mobile And Virtual Reality For Hospitalized Children In Hemato-Oncology: A Conceptual Design

Adolescent Female Users' Avatar Creation in Social Virtual Worlds: Opportunities and Challenges.

Storying My Body in Bits and Bytes

3D Reconstruction and Semantic Modeling of Eyelashes

Relightable and Animatable Neural Avatars from Videos

AvatarVerse: High-Quality & Stable 3D Avatar Creation from Text and Pose

Avatar creation in the metaverse: A focus on event expectations

TEACHING DIGITAL IDENTITY: OPPORTUNITIES, CHALLENGES, AND ETHICAL CONSIDERATIONS FOR AVATAR CREATION IN EDUCATIONAL SETTINGS

FLARE: Fast Learning of Animatable and Relightable Mesh Avatars

EmoStory: Emotion Prediction and Mapping in Narrative Stories

"To Be or Not to Be Me?": Exploration of Self-Similar Effects ofAvatars on Social Virtual Reality Experiences.

4DHumanOutfit: A multi-subject 4D dataset of human motion sequences in varying outfits exhibiting large displacements

A user experience perspective on heritage tourism in the metaverse: Empirical evidence and design dilemmas for VR

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Avatar Creation Research Articles

Articles published on Avatar Creation

Building Consistent Characters through Open-Source Generative AI

Global point cloud registration network for large transformations

Mesh representation matters: investigating the influence of different mesh features on perceptual and spatial fidelity of deep 3D morphable models

3DFaceShop: Explicitly Controllable 3D-Aware Portrait Generation.

Does Sense of Presence Affect People's Politeness? Comparing VRChat and Soul App

Formulating facial mesh tracking as a differentiable optimization problem: a backpropagation-based solution

Creating a metaverse‐me: Exploring the consumer avatar creation process

Customized Avatars In A Multiplatform Game On Mobile And Virtual Reality For Hospitalized Children In Hemato-Oncology: A Conceptual Design

Adolescent Female Users' Avatar Creation in Social Virtual Worlds: Opportunities and Challenges.

Storying My Body in Bits and Bytes

3D Reconstruction and Semantic Modeling of Eyelashes

Relightable and Animatable Neural Avatars from Videos

AvatarVerse: High-Quality &amp; Stable 3D Avatar Creation from Text and Pose

Avatar creation in the metaverse: A focus on event expectations

TEACHING DIGITAL IDENTITY: OPPORTUNITIES, CHALLENGES, AND ETHICAL CONSIDERATIONS FOR AVATAR CREATION IN EDUCATIONAL SETTINGS

FLARE: Fast Learning of Animatable and Relightable Mesh Avatars

EmoStory: Emotion Prediction and Mapping in Narrative Stories

"To Be or Not to Be Me?": Exploration of Self-Similar Effects ofAvatars on Social Virtual Reality Experiences.

4DHumanOutfit: A multi-subject 4D dataset of human motion sequences in varying outfits exhibiting large displacements

A user experience perspective on heritage tourism in the metaverse: Empirical evidence and design dilemmas for VR

AvatarVerse: High-Quality & Stable 3D Avatar Creation from Text and Pose