Multimedia Forensics Research Articles

With the widespread availability of cell-phone recording devices, source cell-phone identification has become a hot topic in multimedia forensics. At present, the research on the source cell-phone identification in clean conditions has achieved good results, but that in noisy environments is not ideal. This paper proposes a novel source cell-phone identification system suitable for both clean and noisy environments using spectral distribution features of constant Q transform (CQT) domain and multi-scene training method. Based on the analysis, it is found that the identification difficulty lies in different models of cell-phones of the same brand, and their tiny differences are mainly in the middle and low frequency bands. Therefore, this paper extracts spectral distribution features from the CQT domain, which has a higher frequency resolution in the mid-low frequency. To evaluate the effectiveness of the proposed feature, four classification techniques of Support Vector Machine (SVM), Random Forest (RF), Convolutional Neural Network (CNN) and Recurrent Neuron Network-Long Short-Term Memory Neural Network (RNN-BLSTM) are used to identify the source recording device. Experimental results show that the features proposed in this paper have superior performance. Compared with Mel frequency cepstral coefficient (MFCC) and linear frequency cepstral coefficient (LFCC), it enhances the accuracy of cell-phones within the same brand, whether the speech to be tested comprises clean speech files or noisy speech files. In addition, the CNN classification effect is outstanding. In terms of models, the model is established by the multi-scene training method, which improves the distinguishing ability of the model in the noisy environment than single-scenario training method. The average accuracy rate in CNN for clean speech files on the CKC speech database (CKC-SD) and TIMIT Recaptured Database (TIMIT-RD) databases increased from 95.47% and 97.89% to 97.08% and 99.29%, respectively. For noisy speech files with seen noisy types and unseen noisy types, the performance was greatly improved, and most of the recognition rates exceeded 90%. Therefore, the source identification system in this paper is robust to noise.

In multimedia forensics, many efforts have been made to detect whether an image is pristine or manipulated with high enough accuracies based on specially designed features and classifiers in the past decade. However, the important task for localizing the tampering regions in a fake image still faces more challenges compared with the manipulation detection and relatively a few algorithms attempt to tackle it. With this in mind, a technique that utilizes the dual-domain-based convolutional neural networks (D-CNNs) taking different kinds of input into consideration is proposed in this paper. In the proposed framework, two sub-networks, named the spatial-domain CNN model (Sub-SCNN) and the frequency-domain-based CNN model (Sub-FCNN), are designed and trained, respectively. With the well-trained parameters, a transfer policy is applied to the training process of the D-CNN. While CNNs are capable of learning classification features directly from data, in their standard form they tend to learn features related to the image’s content. To overcome this issue in image forensics tasks, a new image pre-processing layer is proposed to jointly suppress image’s content and adaptively learn manipulation detection and localization features. After investigating the properties of datasets, two post-processing operations are finally proposed and compared to obtain the final results of the pixel-wise manipulation region localization. The D-CNNs is trained and validated using 75 percent of images in the CASIA v2.0 and tested using the remaining images in the CASIA v2.0, all images in Columbia Uncompressed and Carvalho datasets. The extensive experiments show that the proposed post-processing operations optimize the final tamper probability map, and our framework with the combination of Sub-SCNN and Sub-FCNN significantly outperforms the state-of-art techniques with the best F1 scores on the datasets.

Multimedia Forensics Research Articles

Related Topics

Articles published on Multimedia Forensics

Faster and transferable deep learning steganalysis on GPU

Source smartphone identification by exploiting encoding characteristics of recorded speech

Hybrid reference-based Video Source Identification.

Layer thickness estimation of 3D printed model for digital multimedia forensics

Copy-move forgery detection using combined features and transitive matching

A New Dataset for Source Identification of High Dynamic Range Images.

Constrained Convolutional Neural Networks: A New Approach Towards General Purpose Image Manipulation Detection

Source Cell-Phone Identification in the Presence of Additive Noise from CQT Domain

Compression history detection for MP3 audio

Image Manipulation Detection and Localization Based on the Dual-Domain Convolutional Neural Networks

Coarse-to-Fine Copy-Move Forgery Detection for Video Forensics

VISION: a video and image dataset for source identification

Penerapan Composite Logic dalam Mengkolaborasikan Framework Terkait Multimedia Forensik

Fast reflective offset-guided searching method for copy-move forgery detection

Inter-frame forgery detection in H.264 videos using motion and brightness gradients

ESPRIT-Hilbert-Based Audio Tampering Detection With SVM Classifier for Forensic Analysis via Electrical Network Frequency

Video Inter-frame Forgery Detection Approach for Surveillance and Mobile Recorded Videos

A novel audio forensic data-set for digital multimedia forensics

Design Principles of Convolutional Neural Networks for Multimedia Forensics

Hierarchical image resampling detection based on blind deconvolution

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Multimedia Forensics Research Articles

Related Topics

Articles published on Multimedia Forensics

Faster and transferable deep learning steganalysis on GPU

Source smartphone identification by exploiting encoding characteristics of recorded speech

Hybrid reference-based Video Source Identification.

Layer thickness estimation of 3D printed model for digital multimedia forensics

Copy-move forgery detection using combined features and transitive matching

A New Dataset for Source Identification of High Dynamic Range Images.

Constrained Convolutional Neural Networks: A New Approach Towards General Purpose Image Manipulation Detection

Source Cell-Phone Identification in the Presence of Additive Noise from CQT Domain

Compression history detection for MP3 audio

Image Manipulation Detection and Localization Based on the Dual-Domain Convolutional Neural Networks

Coarse-to-Fine Copy-Move Forgery Detection for Video Forensics

VISION: a video and image dataset for source identification

Penerapan Composite Logic dalam Mengkolaborasikan Framework Terkait Multimedia Forensik

Fast reflective offset-guided searching method for copy-move forgery detection

Inter-frame forgery detection in H.264 videos using motion and brightness gradients

ESPRIT-Hilbert-Based Audio Tampering Detection With SVM Classifier for Forensic Analysis via Electrical Network Frequency

Video Inter-frame Forgery Detection Approach for Surveillance and Mobile Recorded Videos

A novel audio forensic data-set for digital multimedia forensics

Design Principles of Convolutional Neural Networks for Multimedia Forensics

Hierarchical image resampling detection based on blind deconvolution