AE-NeRF: Audio Enhanced Neural Radiance Field for Few Shot Talking Head Synthesis

Dongze Li,Yingya Zhang,Kang Zhao,Wei Wang,Bo Peng,Jing Dong,Tieniu Tan

doi:10.1609/aaai.v38i4.28086

Abstract

Audio-driven talking head synthesis is a promising topic with wide applications in digital human, film making and virtual reality. Recent NeRF-based approaches have shown superiority in quality and fidelity compared to previous studies. However, when it comes to few-shot talking head generation, a practical scenario where only few seconds of talking video is available for one identity, two limitations emerge: 1) they either have no base model, which serves as a facial prior for fast convergence, or ignore the importance of audio when building the prior; 2) most of them overlook the degree of correlation between different face regions and audio, e.g., mouth is audio related, while ear is audio independent. In this paper, we present Audio Enhanced Neural Radiance Field (AE-NeRF) to tackle the above issues, which can generate realistic portraits of a new speaker with few-shot dataset. Specifically, we introduce an Audio Aware Aggregation module into the feature fusion stage of the reference scheme, where the weight is determined by the similarity of audio between reference and target image. Then, an Audio-Aligned Face Generation strategy is proposed to model the audio related and audio independent regions respectively, with a dual-NeRF framework. Extensive experiments have shown AE-NeRF surpasses the state-of-the-art on image fidelity, audio-lip synchronization, and generalization ability, even in limited training set or training iterations.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

AE-NeRF: Audio Enhanced Neural Radiance Field for Few Shot Talking Head Synthesis

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Mar 24, 2024
Citations: 1

Similar Papers

EVALUATION OF VIRTUAL REALITY AND AUGMENTED REALITY FOR TEACHING THE LESSON OF GEOMETRIC SOLIDS TO PRIMARY SCHOOL CHILDREN
Eleni Demitriadou ... Andreas Lanitis
-
Eleni Demitriadou, et. al.Eleni Demitriadou ... Andreas Lanitis
01 Jul 2019
01 Jul 2019

A review of the application of virtual and augmented reality in physical and occupational therapy
Agrawal Luckykumar Dwarkadas ... Viswanath Talasila
Software: Practice and Experience | VOL. -
Agrawal Luckykumar Dwarkadas, et. al.Agrawal Luckykumar Dwarkadas ... Viswanath Talasila
02 Mar 2024
Software: Practice and Experience | VOL. -

Extended-Reality Technologies: An Overview of Emerging Applications in Medical Education and Clinical Care.
Wilfredo López-Ojeda ... Robin A Hurley
The Journal of neuropsychiatry and clinical neurosciences | VOL. 33
Wilfredo López-Ojeda, et. al.Wilfredo López-Ojeda ... Robin A Hurley
01 Jul 2021
The Journal of neuropsychiatry and clinical neurosciences | VOL. 33

Improvement of User Performance in Virtual Reality by Boosting Sense of Agency
Andrii V Lysenkko
Microsystems, Electronics and Acoustics | VOL. 24
Andrii V LysenkkoAndrii V Lysenkko
28 Jun 2019
Microsystems, Electronics and Acoustics | VOL. 24

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

AE-NeRF: Audio Enhanced Neural Radiance Field for Few Shot Talking Head Synthesis

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence