Abstract

This article aims to comprehensively investigate the theoretical and practical foundations, as well as the distinctive characteristics, underpinning the study of multimodal identification algorithms. This investigation will be conducted using state-of-the-art methods and tools of multidimensional analysis. The development of a multimodal algorithm using the method of modality fusion at the feature level encompasses the integration of various algorithms rooted in multivariate analysis. These include a combined voice activity detector, a face detector utilizing the MTCNN (multi-task cascade convolutional networks) architecture, fine-frequency cepstral coefficients, facial image features, and a decision-making module. To construct a multimodal identification algorithm, a framework for combining these algorithms based on multivariate analysis is proposed. Analysis of the acquired data indicates that “Test 1”, utilizing facial image data, exhibits the highest performance indicators, approaching nearly 100%. Tests 2 and 3 involving voice signals exhibit a minor error in the pre-processing stage, attributed to the inherent delay experienced by participants during the video conference. The proposed multimodal algorithm, integrated within a biometric identification system, enables successful user verification research through the utilization of a combined multidimensional analysis algorithm. Furthermore, the algorithm showcases superior research outcomes in comparison to other analogous multimodal identification algorithms, as it yields precise results.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.