Abstract
This paper addresses an integrated information mining techniques for multimedia TV-news archive. The utilizes techniques from the fields of acoustic, image, and video analysis, for information retrieval on news story title, newsman and scene identification. The goal is to construct a compact yet meaningful abstraction of broadcast news video, allowing users to browse through large amounts of data in a non-linear fashion with flexibility and efficiency. By using acoustic analysis, the system can classify video into news versus commercials, with 90% accuracy on a data set of 400 hours TV-news recorded off the air from July 2003 to August of 2004. By applying speaker identification and/or image detection techniques, each news stories can be segmented with an accuracy of 96%. On screen captions or subtitles are recognized by OCR techniques to produce the text title of each news stories. The extracted title words can be used to link or to navigate more related News contents on the WWW. In cooperation with facial and scene analysis and recognition techniques, OCR results can provide users with multimodality query for specific news stories. Some experimental results are presented and discussed for the system reliability and performance evaluation and comparison.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.