Speech extraction based on ICA and audio-visual coherence

D Sodoyer,J.-L Schwartz,C Jutten,L Girin

doi:10.1109/isspa.2003.1224816

Speech extraction based on ICA and audio-visual coherence

D Sodoyer, J.-L Schwartz + Show 2 more

https://doi.org/10.1109/isspa.2003.1224816

Copy DOI

Publication Date: Jan 1, 2003

Citations: 5

Affiliation: Stendhal University, Grenoble Images Parole Signal Automatique, Joseph Fourier University, Laboratoire d’Informatique et Systèmes, French National Centre for Scientific Research, Université Grenoble Alpes

#Acoustic Signal #Speaker's Lip Movements + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

We present a new approach to the source separation problem for multiple speech signals. Using the extra visual information of the speaker's face, the method aims to extract an acoustic speech signal from other acoustic signals by exploiting its coherence with the speaker's lip movements. We define a statistical model of the joint probability of visual and spectral audio input for quantifying the audio-visual coherence. Then, separation can be achieved by maximising this joint probability. Experiments on additive mixtures of 2, 3 and 5 sources show that the algorithm performs well, and systematically better than the classical BSS algorithm JADE.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.