Abstract

Human action recognition in still images provides useful information for use in a wide range of computer vision applications. Because motion information cannot be estimated from a single image, action recognition in still images remains a challenging problem.In this paper, we propose poselet based action recognition methods to infer human action using a two-layer classification model. First, poselets are built using the annotated data of joint locations of people and the proposed Hausdorff distance. Each poselet, consisting of a feature vector, is trained using the first-layer classifier based on random forest classification to find the proper location. From trained poselet detectors, we construct spatial poselet activation vectors (SPAVs) using the voting scores of poselets. A second-layer classifier, which takes aggregating SPAVs of the first-layer classifiers as input, trains a final multi-class classifier. During the testing phase, the input window, which includes a human region, is applied to the first-layer classifier; the aggregating output of the first-layer is applied to the second-layer classifier. After calculating scores for all the c-action classes, the final action class is selected as the one that has the maximum score. Experimental results showed that the recognition performance and processing times of the proposed method was better than those of previous methods.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call