Auditory scene analysis based on time‐frequency integration of shared FM and AM (II): Optimum time‐domain integration and stream sound reconstruction

Mototsugu Abe,Shigeru Ando

doi:10.1002/scj.1160

Abstract

AbstractIn the preceding paper, we have proposed a method for auditory scene analysis, in which the instantaneous frequency, frequency change rate, and amplitude change rate in time‐frequency space are intensified into a multipeak probability density distribution by voting method and the grouping into streams of mixed sounds is realized. In this paper, as the main point of the second half of this method, we will introduce the assumption that the stream parameters vary slowly according to the known dynamics and propose an integration method on the time axis, in which the probability density distribution of the stream parameters is optimally estimated in time series by a nonparametric Kalman filter. By doing so, the mechanism of higher auditory scene analysis such as enhancement of the accuracy of the stream parameters, interpolation and connection of the breaks of the streams, and introduction of a priori knowledge into stream selection can be realized. Moreover, the separation and reconstruction system of sounds which correspond to streams is constructed, and the proposed technique is verified by fundamental experiments for synthesized sounds or musical sounds and voices. © 2002 Wiley Periodicals, Inc. Syst Comp Jpn, 33(10): 83–94, 2002; Published online in Wiley InterScience (www.interscience.wiley.com). DOI 10.1002/scj.1160

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Auditory scene analysis based on time‐frequency integration of shared FM and AM (II): Optimum time‐domain integration and stream sound reconstruction

Abstract

Talk to us

Similar Papers

More From: Systems and Computers in Japan

Lead the way for us

Journal: Systems and Computers in Japan	Publication Date: Jul 2, 2002
Citations: 12

Similar Papers

Auditory scene analysis based on time‐frequency integration of shared FM and AM (I): Lagrange differential features and frequency‐axis integration
Mototsugu Abe ... Shigeru Ando
Systems and Computers in Japan | VOL. 33
Mototsugu Abe, et. al.Mototsugu Abe ... Shigeru Ando
05 Aug 2002
Systems and Computers in Japan | VOL. 33

Functional specialization in auditory cortex: responses to frequency-modulated stimuli in the cat's posterior auditory field.
Peter Heil ... Dexter R F Irvine
Journal of Neurophysiology | VOL. 79
Peter Heil, et. al.Peter Heil ... Dexter R F Irvine
01 Jun 1998
Journal of Neurophysiology | VOL. 79

Sensitivity of neurons in cat primary auditory cortex to tones and frequency-modulated stimuli. I: Effects of variation of stimulus parameters
Peter Heil ... Dexter R.F Irvine
Hearing Research | VOL. 63
Peter Heil, et. al.Peter Heil ... Dexter R.F Irvine
01 Nov 1992
Hearing Research | VOL. 63

Animal models for auditory streaming.
Naoya Itatani ... Georg M Klump
Philosophical transactions of the Royal Society of London. Series B, Biological sciences | VOL. 372
Naoya Itatani, et. al.Naoya Itatani ... Georg M Klump
19 Feb 2017
Philosophical transactions of the Royal Society of London. Series B, Biological sciences | VOL. 372

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Auditory scene analysis based on time‐frequency integration of shared FM and AM (II): Optimum time‐domain integration and stream sound reconstruction

Abstract

Talk to us

Similar Papers

More From: Systems and Computers in Japan