Abstract

In this paper, we propose a novel Spatiotemporal Interest Point (MC-STIP) detector based on the coherent motion pattern around each voxel in videos. Our detector defines the local peaks of optical flow as the interest points in the motion coherence volumes of videos. A concatenating histogram of 2D gradients is introduced to describe each interest point as the descriptor. Moreover, we introduce a Topic Matrix Video Representation (T-Mat) for videos. Our representation not only captures the global hidden topics but also preserves the shared discriminative information among the interest point descriptors. We conduct our experiments on three benchmark datasets to recognize human actions using Support Vector Machines with four different kernels. The experiments demonstrate the effectiveness of our new approach.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.