Abstract
In this paper, we propose a supervised dictionary learning algorithm for action recognition in still images followed by a discriminative weighting model. The dictionary is learned based on Local Fisher Discrimination which takes into account the local manifold structure and discrimination information of local descriptors. The label information of local descriptors is considered in both dictionary learning and sparse coding stage which generates a supervised sparse coding algorithm and makes the coding coefficients discriminative. Instead of using spatial pyramid features, sliding window-based features with max-pooling are computed from coding coefficients. And then a discriminative weighting model combining a max-margin classifier is proposed using the features. Both the weighting coefficients and model parameters can be jointly learned using the same way in Multiple Kernel Learning algorithm. We validate our model on the following action recognition datasets: Willow 7 human actions dataset, People Playing Music Instrument (PPMI) dataset, and Sports dataset. To show the generality of our model, we also validate it on Scene15 dataset. The experiment results show that only with single scale local descriptors, our algorithm is comparable to some state-of-the-art algorithms.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.