An embedded audio–visual tracking and speech purification system on a dual-core processor platform

Jwu-Sheng Hu,Ming-Tang Lee,Chia-Hsing Yang

doi:10.1016/j.micpro.2010.05.004

Abstract

Design of an embedded audio–visual tracking and speech purification system is described in this paper. The system is able to perform human face tracking, voice activity detection, sound source direction estimation, and speech enhancement in real-time. Estimating the sound source directions helps to initialize the human face tracking module when the target changes the direction. The implementation architecture is based on an embedded dual-core processor, Texas Instruments DM6446 platform (Davinci), which contains an ARM core and a DSP core. For speech signal processing, an eight-channel digital microphone array is developed and the associated pre-processing and interfacing features are designed using the Altera Cyclone II FPGA. All the experiments are conducted in a real environment and the experimental results show that this system can execute all the audition and vision functions in real-time.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An embedded audio–visual tracking and speech purification system on a dual-core processor platform

Abstract

Talk to us

Similar Papers

More From: Microprocessors and Microsystems

Lead the way for us

Journal: Microprocessors and Microsystems	Publication Date: Jun 4, 2010
Citations: 21

Similar Papers

Localization of Sound Source Direction Using the Binaural Model
Zhong Zhang ... Takashi Imamura
TRANSACTIONS OF THE JAPAN SOCIETY OF MECHANICAL ENGINEERS Series C | VOL. 74
Zhong Zhang, et. al.Zhong Zhang ... Takashi Imamura
01 Jan 2008
TRANSACTIONS OF THE JAPAN SOCIETY OF MECHANICAL ENGINEERS Series C | VOL. 74

Active Microphone with Parabolic Reflection Board for Estimation of Sound Source Direction
Tetsuya Takiguchi ... Yasuo Ariki
-
Tetsuya Takiguchi, et. al.Tetsuya Takiguchi ... Yasuo Ariki
01 May 2008
01 May 2008

Estimation of Sound Source Direction of Arrival Map Using Convolutional Neural Network and Cross-Correlation in Frequency Bands
Saulius Sakavicius ... Arturas Serackis
-
Saulius Sakavicius, et. al.Saulius Sakavicius ... Arturas Serackis
01 Apr 2019
01 Apr 2019

Speech Enhancement Aided End-To-End Multi-Task Learning for Voice Activity Detection
Xu Tan ... Xiao-Lei Zhang
-
Xu Tan, et. al.Xu Tan ... Xiao-Lei Zhang
06 Jun 2021
06 Jun 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An embedded audio–visual tracking and speech purification system on a dual-core processor platform

Abstract

Talk to us

Similar Papers

More From: Microprocessors and Microsystems