Performance Analysis of Feature Mel Frequency Cepstral Coefficient and Short Time Fourier Transform Input for Lie Detection using Convolutional Neural Network

Dewi Kusumawati,Amil Ahmad Ilham,Ingrid Nurtanio,Andani Achmad

doi:10.62527/joiv.8.1.2062

Dewi Kusumawati, Amil Ahmad Ilham + Show 2 more

Open Access

https://doi.org/10.62527/joiv.8.1.2062

Copy DOI

Abstract

This study aims to determine which model is more effective in detecting lies between models with Mel Frequency Cepstral Coefficient (MFCC) and Short Time Fourier Transform (STFT) processes using Convolutional Neural Network (CNN). MFCC and STFT processes are based on digital voice data from video recordings that have been given lie or truth information regarding certain situations. Data is then pre-processed and trained on CNN. The results of model performance evaluation with hyper-tuning parameters and random search implementation show that using MFCC as Voice data processing provides better performance with higher accuracy than using the STFT process. The best parameters from MFCC are obtained with filter convolutional=64, kerneconvolutional1=5, filterconvolutional2=112, kernel convolutional2=3, filter convolutional3=32, kernelconvolutional3 =5, dense1=96, optimizer=RMSProp, learning rate=0.001 which achieves an accuracy of 97.13%, with an AUC value of 0.97. Using the STFT, the best parameters are obtained with filter convolutional1=96, kernel convolutional1=5, convolutional2 filters=48, convolutional2 kernels=5, convolutional3 filters=96, convolutional3 kernels=5, dense1=128, Optimizer=Adaddelta, learning rate=0.001, which achieves an accuracy of 95.39% with an AUC value of 0.95. Prosodics are used to compare the performance of MFCC and STFT. The result is that prosodic has a low accuracy of 68%. The analysis shows that using MFCC as the process of sound extraction with the CNN model produces the best performance for cases of lie detection using audio. It can be optimized for further research by combining CNN architectural models such as ResNet, AlexNet, and other architectures to obtain new models and improve lie detection accuracy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Performance Analysis of Feature Mel Frequency Cepstral Coefficient and Short Time Fourier Transform Input for Lie Detection using Convolutional Neural Network

Abstract

Talk to us

Similar Papers

More From: JOIV : International Journal on Informatics Visualization

Lead the way for us

Journal: JOIV : International Journal on Informatics Visualization	Publication Date: Mar 31, 2024
License type: CC BY-SA 4.0

Similar Papers

Heart Murmur Classification in Phonocardiogram Representations Using Convolutional Neural Networks
Mehlam Shabbir ... Mona Nasseri
The International FLAIRS Conference Proceedings | VOL. 36
Mehlam Shabbir, et. al.Mehlam Shabbir ... Mona Nasseri
08 May 2023
The International FLAIRS Conference Proceedings | VOL. 36

CNN based approach for Speech Emotion Recognition Using MFCC, Croma and STFT Hand-crafted features
Nagendra Kumar ... Ratndeep Kaushal
-
Nagendra Kumar, et. al.Nagendra Kumar ... Ratndeep Kaushal
17 Dec 2021
17 Dec 2021

Comparative Analysis of Audio Processing Techniques on Doppler Radar Signature of Human Walking Motion Using CNN Models.
Minh-Khue Ha ... Nguyen Van Hieu
Sensors (Basel, Switzerland) | VOL. 23
Minh-Khue Ha, et. al.Minh-Khue Ha ... Nguyen Van Hieu
26 Oct 2023
Sensors (Basel, Switzerland) | VOL. 23

A Tiny CNN Architecture for Identifying Bat Species from Echolocation Calls
Imran Zualkernan ... Priyanka Chand
-
Imran Zualkernan, et. al.Imran Zualkernan ... Priyanka Chand
21 Sep 2020
21 Sep 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Performance Analysis of Feature Mel Frequency Cepstral Coefficient and Short Time Fourier Transform Input for Lie Detection using Convolutional Neural Network

Abstract

Talk to us

Similar Papers

More From: JOIV : International Journal on Informatics Visualization