Abstract

The wide use of digital speech recorders becomes a serious matter when they are involved in assisting with court rulings. How to distinguish if a recorded content is valid or not becomes a life-or-death question. In light of this concern, least significant bits (LSB) of excitation signals would be used as fragile watermarks in the hybrid speech vocoder. In addition, a location-variable content-dependent watermark generating mechanism is proposed. Such location-variable content-based watermark would allow users to detect where in the recording the content is being replaced, inserted, or deleted. Lastly, an attempt is done to store partial reconstruction data in the LSBs of excitation signals in the G.723.1 speech codec, so that the original speech content may be reconstructed after counterfeited. The proposed system is demonstrated to be a reliable system, with test results showing that a recording with watermarks has a perceptual evaluation of speech quality (PESQ) value down 0.2, while the accuracy in detecting faked regions can be up to 97.45%.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call