Exploiting Local Feature Fusion for Action Recognition

Jie Miao,Haoyu Huang,Chunmei Qing,Xiangmin Xu,Bolun Cai,Xiaofen Xing,Xiaoyi Jia

doi:10.1007/978-3-319-48896-7_22

Abstract

Densely sampled local features with bag-of-words models have been widely applied to action recognition. Conventional approaches assume that different kinds of local features are totally uncorrelated, and they are separately processed, encoded, and then fused at video-level representation. However, these local features are not totally uncorrelated in practice. To address this problem, multi-view local feature fusion is exploited for local descriptor fusion in action recognition. Specifically, tensor canonical correlation analysis (TCCA) is employed to obtain a fused local feature that carries the high-order correlation hidden among different types of local features. The high-order correlation local feature improves the conventional concatenation based fusion approach. Experimental results on three challenging action recognition datasets validate the effectiveness of the proposed approach.

Full Text