Abstract

The encryption process for secure voice communication may degrade the speech quality when it is applied to the speech signals before encoding them through a conventional communication system such as GSM or radio trunking. This is because the encryption process usually includes a randomization of the speech signals, and hence, when the speech is decrypted, it may perceptibly be distorted, so satisfactory speech quality for communication is not achieved. To deal with this, we could apply a speech enhancement method to improve the quality of decrypted speech. However, many speech enhancement methods work by assuming noise is present all the time, so the voice activity detector (VAD) is applied to detect the non-speech period to update the noise estimate. Unfortunately, this assumption is not valid for the decrypted speech. Since the encryption process is applied only when speech is detected, distortions from the secure communication system are characteristically different. They exist when speech is present. Therefore, a noise estimator that is able to update noise even when speech is present is needed. However, most noise estimator techniques only adapt to slow changes of noise to avoid over-estimation of noise, making them unsuitable for this task. In this paper, we propose a speech enhancement technique to improve the quality of speech from secure communication. We use a combination of the Wiener filter and spectral subtraction for the noise estimator, so our method is better at tracking fast changes of noise without over-estimating them. Our experimental results on various communication channels indicate that our method is better than other popular noise estimators and speech enhancement methods.

Highlights

  • Secure voice communication is necessary for many applications

  • We evaluated our method using the perceptual evaluation of speech quality (PESQ) [48] and frequency-weighted segmental signal-to-noise ratio (SNR) (FwSNR) [49]

  • We compare our method with several speech enhancement methods and noise estimators

Read more

Summary

Introduction

Transmitted speech may have sensitive and personal information of the speakers that needs to be kept confidential. For military purposes, this becomes even more important since the information may affect national security. The Global System for Mobile (GSM) technologies is mainly used for voice communications in many countries, even for military purposes. While they ensure confidentiality for the radio access channel, its security is not guaranteed when speech is transmitted across the switch network using standard modulation such as pulse-code modulation (PCM) and adaptive differential pulse-code modulation (ADPCM) forms [1]. The decrypted speech may be distorted since the encryption process usually includes a randomization of the speech signals, and as a result, the speech signals may fail to reach sufficient quality

Objectives
Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call