Abstract

This paper addresses the problem of action recognition with improved dense trajectories (IDT). Recently, IDT achieved a significant performance in action recognition with realistic videos. However, the efficiency of storage and the speed of classification are limited due to the dense samples in feature space. To address this issue, the intuitive way is to reduce the dimension and adopt a fast classification method. Therefore, we explore the influence of dimensionality reduction on the recognition rate. In addition, Extreme Learning Machine (ELM) is adopted to further improve classification efficiency. We present performance on the KTH, UCF11, HMDB51, and UCF101 datasets in all kinds of situations such as the different fusion methods, the different dimensionality reduction, and different learning methods. As a result, it can be observed that ELM with principal components analysis (PCA) improves the performance in terms of mean average precision (mAP) which not only significantly reduces computational cost but improves accuracy. What's more, the training and testing time decrease 1–2 orders of magnitude without losing accuracy when Fisher vector (FV) adopts reduction techniques before it fed into classifier.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.