Evaluation of digital watermarking on subjective speech quality

Yann Kowalczuk,Jan Holub

doi:10.1038/s41598-021-99811-x

Abstract

New methods of securing the distribution of audio content have been widely deployed in the last twenty years. Their impact on perceptive quality has, however, only been seldomly the subject of recent extensive research. We review digital speech watermarking state of the art and provide subjective testing of watermarked speech samples. Latest speech watermarking techniques are listed, with their specifics and potential for further development. Their current and possible applications are evaluated. Open-source software designed to embed watermarking patterns in audio files is used to produce a set of samples that satisfies the requirements of modern speech-quality subjective assessments. The patchwork algorithm that is coded in the application is mainly considered in this analysis. Different watermark robustness levels are used, which allow determining the threshold of detection to human listeners. The subjective listening tests are conducted following ITU-T P.800 Recommendation, which precisely defines the conditions and requirements for subjective testing. Further analysis tries to determine the effects of noise and various disturbances on watermarked speech’s perceived quality. A threshold of intelligibility is estimated to allow further openings on speech compression techniques with watermarking. The impact of language or social background is evaluated through an additional experiment involving two groups of listeners. Results show significant robustness of the watermarking implementation, retaining both a reasonable net subjective audio quality and security attributes, despite mild levels of distortion and noise. Extended experiments with Chinese listeners open the door to formulate a hypothesis on perception variations with geographical and social backgrounds.

Highlights

New methods of securing the distribution of audio content have been widely deployed in the last twenty years
In order to determine these two limits, selected speech samples will be gradually watermarked with increased robustness, leading in progressive speech quality degradation
The votes are based on a MOS (Mean Opinion Score) scale, as described by ITU-T Recommendation P.800

Summary

Introduction

New methods of securing the distribution of audio content have been widely deployed in the last twenty years Their impact on perceptive quality has, only been seldomly the subject of recent extensive research. Results show significant robustness of the watermarking implementation, retaining both a reasonable net subjective audio quality and security attributes, despite mild levels of distortion and noise. Thanks to a simple key exchange process, it may be used to trust non-encrypted transmissions, such as telephone or radio transmission; this principle may include emitter identification, using a dedicated watermarking pattern decoded in the receiver An extension of this principle in encrypted, compressed audio transmissions is of significant interest in modern cybersecurity. In telecommunication transmission quality tests, the ITU-T P800 Recommendation is widely used It states conditions for subjective audio quality assessment; Scientific Reports | (2021) 11:20185. The subjects must be seated in an anechoic or semi-anechoic listening environment, with specific guidelines for sample creation and playback

Objectives

Methods

Results

Discussion

Conclusion

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Evaluation of digital watermarking on subjective speech quality

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific Reports

Lead the way for us

Journal: Scientific Reports	Publication Date: Oct 12, 2021
License type: open-access

Similar Papers

Effects of time domain response on sonic characteristics of microphones
Benjamin D Findlen ... Eric L Reuter
The Journal of The Acoustical Society of America | VOL. 104
Benjamin D Findlen, et. al.Benjamin D Findlen ... Eric L Reuter
01 Sep 1998
The Journal of The Acoustical Society of America | VOL. 104

A Bayesian-Maximum Entropy Approach to Subjective Voice Quality Testing
Ali E Abbas
-
Ali E AbbasAli E Abbas
01 Jan 2004
01 Jan 2004

CUSUM application in perceptual speech quality control
... Sven Nordholm
-
, et. al. ... Sven Nordholm
01 Oct 2009
01 Oct 2009

Subjective Multimedia Quality Assessment
M D Brotherton ... Q Huynh-Thu
IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences | VOL. E89-A
M D Brotherton, et. al.M D Brotherton ... Q Huynh-Thu
01 Nov 2006
IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences | VOL. E89-A

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Evaluation of digital watermarking on subjective speech quality

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific Reports