MF 2 ShrT: Multimodal Feature Fusion Using Shared Layered Transformer for Face Anti-spoofing

Aashania Antil,Chhavi Dhiman

doi:10.1145/3640817

Abstract

In recent times, Face Anti-spoofing (FAS) has gained significant attention in both academic and industrial domains. Although various convolutional neural network (CNN)-based solutions have emerged, multimodal approaches incorporating RGB, depth, and information retrieval (IR) have exhibited better performance than unimodal classifiers. The increasing veracity of modern presentation attack instruments results in a persistent need to enhance the performance of such models. Recently, self-attention-based vision transformers (ViT) have become a popular choice in this field. Their fundamental aspects for multimodal FAS have not been thoroughly explored yet. Therefore, we propose a novel framework for FAS called MF 2 ShrT, which is based on a pretrained vision transformer. The proposed framework uses overlap patches and parameter sharing in the ViT network, allowing it to utilize multiple modalities in a computationally efficient manner. Furthermore, to effectively fuse intermediate features from different encoders of each ViT, we explore a T-encoder-based hybrid feature block enabling the system to identify correlations and dependencies across different modalities. MF 2 ShrT outperforms conventional vision transformers and achieves state-of-the-art performance on benchmarks CASIA-SURF and WMCA, demonstrating the efficiency of transformer-based models for presentation attack detection PAD).

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

MF 2 ShrT: Multimodal Feature Fusion Using Shared Layered Transformer for Face Anti-spoofing

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Multimedia Computing, Communications, and Applications

Lead the way for us

Journal: ACM Transactions on Multimedia Computing, Communications, and Applications	Publication Date: Mar 8, 2024
Citations: 3

Similar Papers

Analysing the Performance of LSTMs and CNNs on 1310 nm Laser Data for Fingerprint Presentation Attack Detection
Jascha Kolberg ... Alexandru-Cosmin Vasile
-
Jascha Kolberg, et. al.Jascha Kolberg ... Alexandru-Cosmin Vasile
28 Sep 2020
28 Sep 2020

Deep Learning for Face Anti-Spoofing: A Survey.
Zitong Yu ... Yunxiao Qin
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 45
Zitong Yu, et. al.Zitong Yu ... Yunxiao Qin
01 Jan 2021
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 45

Face Anti-Spoofing with Multi-Scale Information
Shiying Luo ... Xilin Chen
-
Shiying Luo, et. al.Shiying Luo ... Xilin Chen
01 Aug 2018
01 Aug 2018

Multi-modal Face Anti-spoofing Using Channel Cross Fusion Network and Global Depth-Wise Convolution
Qian Zhou ... Shidong Chen
-
Qian Zhou, et. al.Qian Zhou ... Shidong Chen
01 Jan 2021
01 Jan 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

MF 2 ShrT: Multimodal Feature Fusion Using Shared Layered Transformer for Face Anti-spoofing

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Multimedia Computing, Communications, and Applications