Cross-power Spectrum Phase Research Articles

AbstractThis paper describes one digital watermarking method for audio signals using the MPEG psychoacoustic model. It is important that a digital watermark be perceptually inaudible to the human auditory system. One watermark algorithm that takes this into account is the digital watermark algorithm using the MPEG psychoacoustic model proposed by Boney and colleagues. However, this method has several drawbacks such as it is susceptible to MPEG encoding and there is no synchronized detection measure. As a result, several improvements were made to increase the robustness of the digital watermark to MPEG encoding, and information from psychoacoustic tests was also used to introduce successive masking. These improvements enabled a watermark to be implemented that is even robust to MPEG encoding. To ensure robustness to D‐A/A‐D conversions or cropping attacks, a whitened cross‐correlation method (cross‐power spectrum phase) was used to implement synchronized detection. This enabled a watermark that is robust to D‐A/A‐D conversions to be implemented. This watermark was also shown to be robust to attacks such as noise addition, filtering, and downsampling. In addition, this paper shows the results of subjective evaluation experiments, which indicate that there is little quality deterioration due to watermarking. © 2003 Wiley Periodicals, Inc. Electron Comm Jpn Pt 3, 86(12): 65–75, 2003; Published online in Wiley InterScience (www.interscience.wiley.com). DOI 10.1002/ecjc.10143

Read full abstract

It is very important to capture distant-talking speech for a hands-free speech interface with high quality. A microphone array is an ideal candidate for this purpose. However, this approach requires localizing the target talker. Conventional talker localization algorithms in multiple sound source environments not only have difficulty localizing the multiple sound sources accurately, but also have difficulty localizing the target talker among known multiple sound source positions. To cope with these problems, we propose a new talker localization algorithm consisting of two algorithms. One is DOA (direction of arrival) estimation algorithm for multiple sound source localization based on CSP (cross-power spectrum phase) coefficient addition method. The other is statistical sound source identification algorithm based on GMM (Gaussian mixture model) for localizing the target talker position among localized multiple sound sources. In this paper, we particularly focus on the talker localization performance based on the combination of these two algorithms with a microphone array. We conducted evaluation experiments in real noisy reverberant environments. As a result, we confirmed that multiple sound signals can be identified accurately between ‘‘speech’’ or ‘‘non-speech’’ by the proposed algorithm. [Work supported by ATR, and MEXT of Japan.]

Read full abstract

Cross-power Spectrum Phase Research Articles

Related Topics

Articles published on Cross-power Spectrum Phase

Robust Talker Direction Estimation Based on Weighted CSP Analysis and Maximum Likelihood Estimation

Digital watermarks for audio signal based on psychoacoustic masking model

An evaluation of talker localization based on direction of arrival estimation and statistical sound source identification

Use of the crosspower-spectrum phase in acoustic event location

Estimation of wavefront arrival delay for acoustical signals using the cross-power spectrum phase technique

A DSP implementation of source location using microphone arrays.

Analysis of in-core dynamics in pressurized water reactors with application to parameter monitoring

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Cross-power Spectrum Phase Research Articles

Related Topics

Articles published on Cross-power Spectrum Phase

Robust Talker Direction Estimation Based on Weighted CSP Analysis and Maximum Likelihood Estimation

Digital watermarks for audio signal based on psychoacoustic masking model

An evaluation of talker localization based on direction of arrival estimation and statistical sound source identification

Use of the crosspower-spectrum phase in acoustic event location

Estimation of wavefront arrival delay for acoustical signals using the cross-power spectrum phase technique

A DSP implementation of source location using microphone arrays.

Analysis of in-core dynamics in pressurized water reactors with application to parameter monitoring