Abstract

This research focuses on the application of joint time-frequency (TF) analysis for watermarking and classifying different audio signals. Time frequency analysis which originated in the 1930s has often been used to model the non-stationary behaviour of speech and audio signals. By taking into consideration the human auditory system which has many non-linear effects and its masking properties, we can extract efficient features from the TF domain to watermark or classify signals. This novel audio watermarking scheme is based on spread spectrum techniques and uses content-based analysis to detect the instananeous mean frequency (IMF) of the input signal. The watermark is embedded in this perceptually significant region such that it will resist attacks. Audio watermarking offers a solution to data privacy and helps to protect the rights of the artists and copyright holders. Using the IMF, we aim to keep the watermark imperceptible while maximizing its robustness. In this case, 25 bits are embedded and recovered witin a 5 s sample of an audio signal. This scheme has shown to be robust against various signal processing attacks including filtering, MP3 compression, additive moise and resampling with a bit error rate in the range of 0-13%. In addition content-based classification is performed using TF analysis to classify sounds into 6 music groups consisting of rock, classical, folk, jazz and pop. The features that are extracted include entropy, centroid, centroid ratio, bandwidth, silence ratio, energy ratio, frequency location of minimum and maximum energy. Using a database of 143 signals, a set of 10 time-frequncy features are extracted and an accuracy of classification of around 93.0% using regular linear discriminant analysis or 92.3% using leave one out method is achieved.

Highlights

  • J OINT time-frequency (TF) analysis of signals such as radar, sonar, communica­ tions and biomedical signals is necessary to understand and analyze their true non-stationary behaviour

  • We examine the application of the spectrogram which was previously used primarily to give a visual representation of the signal, to extract numbers and features that will be applied to the watermarking and retrieval techniques

  • In this Chapter, we proposed a novel spread spectrum watermarking technique that utilizes instantaneous mean frequency (IMF) estimation of the original audio signal and the simultaneous masking property to determine optimal points of insertion of the watermark

Read more

Summary

W igner-Ville distribution of two tones (tones are clearly visible at 90

3.8 C alculating IE and IMF of a ROCK music signal “acorg.wav” : 4.3 Comparison of entropy values a) Results for different genres b) Distri­. 4.7 All-groups scatter plot with the first two canonical discriminant functions 72 4.8 Territorial Map- Symbols used in territorial map: Symbol, Group, La­

Introduction
O rganization of th e T hesis
Short-T im e Fourier Transform
W igner-Ville Distribution
Applications of TF analysis and STFT
C hapter Sum m ary
A pplications
Related Work
M otivation
B ackground and M ethodology
Introduction to Spread Spectrum System s
Spread Spectrum Characteristics
Spread Spectrum Techniques
Instantaneous mean frequency estimation
Watermarking algorithm
Sim ulation R esults
Conclusions
R elated W ork
Audio Feature Extraction
Entropy
E nergy ratio
Brightness
B andw idth
Silence ratio
Summary of Features
Linear Discriminant Analysis
C lassification R esults
Spread spectrum watermarking and instantaneous mean frequency
Content based audio classification
Findings
Future work
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call