Audio-Visual Kinship Verification: A New Dataset and a Unified Adaptive Adversarial Multimodal Learning Approach.

Xiaoting Wu,Xiaoyi Feng,Xueyi Zhang,Miguel Bordallo López,Li Liu

doi:10.1109/tcyb.2022.3220040

Abstract

Facial kinship verification refers to automatically determining whether two people have a kin relation from their faces. It has become a popular research topic due to potential practical applications. Over the past decade, many efforts have been devoted to improving the verification performance from human faces only while lacking other biometric information, for example, speaking voice. In this article, to interpret and benefit from multiple modalities, we propose for the first time to combine human faces and voices to verify kinship, which we refer it as the audio-visual kinship verification study. We first establish a comprehensive audio-visual kinship dataset that consists of familial talking facial videos under various scenarios, called TALKIN-Family. Based on the dataset, we present the extensive evaluation of kinship verification from faces and voices. In particular, we propose a deep-learning-based fusion method, called unified adaptive adversarial multimodal learning (UAAML). It consists of the adversarial network and the attention module on the basis of unified multimodal features. Experiments show that audio (voice) information is complementary to facial features and useful for the kinship verification problem. Furthermore, the proposed fusion method outperforms baseline methods. In addition, we also evaluate the human verification ability on a subset of TALKIN-Family. It indicates that humans have higher accuracy when they have access to both faces and voices. The machine-learning methods could effectively and efficiently outperform the human ability. Finally, we include the future work and research opportunities with the TALKIN-Family dataset.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Cybernetics	Publication Date: Mar 1, 2024
Citations: 3	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Audio-Visual Kinship Verification: A New Dataset and a Unified Adaptive Adversarial Multimodal Learning Approach.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Cybernetics

Lead the way for us

Similar Papers

Prototype-Based Discriminative Feature Learning for Kinship Verification
Haibin Yan ... Xiuzhuang Zhou
IEEE Transactions on Cybernetics | VOL. 45
Haibin Yan, et. al.Haibin Yan ... Xiuzhuang Zhou
10 Dec 2014
IEEE Transactions on Cybernetics | VOL. 45

Feature Fusion and NRML Metric Learning for Facial Kinship Verification
Fahimeh Ramazankhani ... Mahdi Yazdian-Dehkord
JUCS - Journal of Universal Computer Science | VOL. 29
Fahimeh Ramazankhani, et. al.Fahimeh Ramazankhani ... Mahdi Yazdian-Dehkord
28 Apr 2023
JUCS - Journal of Universal Computer Science | VOL. 29

Neighborhood Repulsed Metric Learning for Kinship Verification
Jiwen Lu ... Yuanyuan Shang
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 36
Jiwen Lu, et. al. Jiwen Lu ... Yuanyuan Shang
01 Feb 2014
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 36

A literature survey on kinship verification through facial images
Xiaoqian Qin ... Dong Wang
Neurocomputing | VOL. 377
Xiaoqian Qin, et. al.Xiaoqian Qin ... Dong Wang
18 Oct 2019
Neurocomputing | VOL. 377

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Audio-Visual Kinship Verification: A New Dataset and a Unified Adaptive Adversarial Multimodal Learning Approach.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Cybernetics