Abstract

Audio splicing is one of the most common manipulation techniques in the audio forensic world. In this paper, the magnitudes of acoustic channel impulse response and ambient noise are considered as the environmental signature and used to authenticate the integrity of query audio and identify the spliced audio segments. The proposed scheme firstly extracts the magnitudes of channel impulse response and ambient noise by applying the spectrum classification technique to each suspected frame. Then, correlation between the magnitudes of query frame and reference frame is calculated. An optimal threshold determined according to the statistical distribution of similarities is used to identify the spliced frames. Furthermore, a refining step using the relationship between adjacent frames is adopted to reduce the false positive rate and false negative rate. Effectiveness of the proposed method is tested on two data sets consisting of speech recordings of human speakers. Performance of the proposed method is evaluated for various experimental settings. Experimental results show that the proposed method not only detects the presence of spliced frames, but also localizes the forgery segments. Comparison results with previous work illustrate the superiority of the proposed scheme.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call