Extract voice information using high-speed camera

Mariko Akutsu,Yasuhiro Oikawa,Yoshio Yamasaki

doi:10.1121/1.4805440

Abstract

Conversation is one of the most important channels for human beings. To help communications, speech recognition technologies have been developed. Above all, in a conversation, not only contents of utterances but also intonations and tones include important information regarding a speaker’s intention. To study the sphere of human speech, microphones are typically used to record voices. However, since microphones have to be set around a space, their existences affect a physical behavior of the sound field. To challenge this problem, we have suggested a recording method using a high-speed camera. By using a high-speed camera for recording sound vibrations, it can record two or more points within the range of the camera at the same time and can record from a distance, without interfering with the sound fields. In this study, we extract voice information using high-speed videos, which capture both a face and a cervical part of the subject. This method allows recording skin vibrations, which contain voices with individuality and extrapolating sound waves by using an image processing method. The result of the experiment shows that a high-speed camera is capable of recording voice information.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Extract voice information using high-speed camera

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America

Lead the way for us

Journal: The Journal of the Acoustical Society of America	Publication Date: May 1, 2013
Citations: 3

Similar Papers

Extract voice information using high-speed camera
Mariko Akutsu ... Yoshio Yamasaki
-
Mariko Akutsu, et. al.Mariko Akutsu ... Yoshio Yamasaki
01 Jan 2013
01 Jan 2013

An Analysis of Perspectives for Using High-Speed Cameras in Processing Dynamic Video Information
Denis Viktorovich Ivanko ... Alexey Anatolievich Karpov
SPIIRAS Proceedings | VOL. 1
Denis Viktorovich Ivanko, et. al.Denis Viktorovich Ivanko ... Alexey Anatolievich Karpov
15 Feb 2016
SPIIRAS Proceedings | VOL. 1

Multimodal speech recognition: increasing accuracy using high speed video data
Denis Ivanko ... Wolfgang Minker
Journal on Multimodal User Interfaces | VOL. 12
Denis Ivanko, et. al.Denis Ivanko ... Wolfgang Minker
01 Aug 2018
Journal on Multimodal User Interfaces | VOL. 12

Quantification of Ciliary Beat Frequency in Sinonasal Epithelial Cells Using Differential Interference Contrast Microscopy and High-Speed Digital Video Imaging
Ioana Schipor ... James N Palmer
American Journal of Rhinology | VOL. 20
Ioana Schipor, et. al.Ioana Schipor ... James N Palmer
01 Jan 2006
American Journal of Rhinology | VOL. 20

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Extract voice information using high-speed camera

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America