Speech recognition for a digital video library

Michael J Witbrock,Alexander G Hauptmann

doi:10.1002/(sici)1097-4571(1998)49:7<619::aid-asi4>3.0.co;2-1

Abstract

The standard method for making the full content of audio and video material searchable is to annotate it with human-generated meta-data that describes the content in a way that the search can understand, as is done in the creation of multimedia CD-ROMs. However, for the huge amounts of data that could usefully be included in digital video and audio libraries, the cost of producing this meta-data is prohibitive. In the Informedia Digital Video Library, the production of the meta-data supporting the library interface is automated using techniques derived from artificial intelligence (AI) research. By applying speech recognition together with natural language processing, information retrieval, and image analysis, an interface has been produced that helps users locate the information they want, and navigate or browse the digital video library more effectively. Specific interface components include automatic titles, filmstrips, video skims, word location marking, and representative frames for shots. Both the user interface and the information retrieval engine within Informedia are designed for use with automatically derived meta-data, much of which depends on speech recognition for its production. Some experimental information retrieval results will be given, supporting a basic premise of the Informedia project: That speech recognition generated transcripts can make multimedia material searchable. The Informedia project emphasizes the integration of speech recognition, image processing, natural language processing, and information retrieval to compensate for deficiencies in these individual technologies. © 1998 John Wiley & Sons, Inc.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Speech recognition for a digital video library

Abstract

Talk to us

Similar Papers

More From: Journal of the American Society for Information Science

Lead the way for us

Journal: Journal of the American Society for Information Science	Publication Date: Jan 1, 1998
Citations: 37

Similar Papers

Speech recognition for a digital video library
Michael J Witbrock ... Alexander G Hauptmann
Computer Standards & Interfaces | VOL. 20
Michael J Witbrock, et. al.Michael J Witbrock ... Alexander G Hauptmann
01 Mar 1999
Computer Standards & Interfaces | VOL. 20

Artificial intelligence techniques in the interface to a Digital Video Library
Alexander G Hauptmann ... Michael J Witbrock
-
Alexander G Hauptmann, et. al.Alexander G Hauptmann ... Michael J Witbrock
01 Jan 1997
01 Jan 1997

Addressing the challenge of visual information access from digital image and video libraries
Michael G Christel ... Ronald M Conescu
-
Michael G Christel, et. al.Michael G Christel ... Ronald M Conescu
07 Jun 2005
07 Jun 2005

Automated Video Indexing of Very Large Video Libraries
H D Wactlar ... A G Hauptmann
SMPTE Journal | VOL. 106
H D Wactlar, et. al.H D Wactlar ... A G Hauptmann
01 Aug 1997
SMPTE Journal | VOL. 106

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Speech recognition for a digital video library

Abstract

Talk to us

Similar Papers

More From: Journal of the American Society for Information Science