Abstract

Video event description is an important research topic in video analysis with a vast amount of applications, such as visual surveillance, video retrieval, video annotation, video database indexing, and interactive system. In this paper, we present a framework for automated video event description, which features fused with the context knowledge to provide accurate and reliable event description. The processing framework is designed to describe the event and recognize objects activities composed of four components: object detection, classification, tracking, and semantic event description. Our contribution is to integrate the contextual cues into these components to facilitate the semantic video event description. Furthermore, in the tracking part, a novel adaptive shape kernel based mean shift tracking algorithm is proposed to improve object tracking performance under object deformation and background clutter. In the experiments, we show attractive experimental results, highlighting the system efficiency and tracking capability by using our video event description system on a real-world video for video event understanding application.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.