Two-stream Deep Residual Learning with Fisher Criterion for Human Action Recognition

Dinh Viet Sang,Hoang Trung Dung

doi:10.1145/3287921.3287972

Two-stream Deep Residual Learning with Fisher Criterion for Human Action Recognition

Dinh Viet Sang, Hoang Trung Dung

https://doi.org/10.1145/3287921.3287972

Copy DOI

Publication Date: Jan 1, 2018

Citations: 1

Affiliation: Hanoi University of Science and Technology

#Standard Back Propagation Algorithm #Two-stream CNN + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Action recognition is one of the most important areas in the computer vision community. Many previous work use two-stream CNN model to obtain both spatial and temporal clues for predicting task. However, two stream are trained separately and combined later by late fusion. This strategy has overlooked the spatial-temporal features interaction. In this paper, we propose new two-stream CNN architectures that are able to learn the relation between two kinds of features. Furthermore, they can be trained end-to-end with standard back propagation algorithm. We also introduce a Fisher loss that makes features more discriminative. The experiments show that Fisher loss yields higher accuracy than using only the softmax loss.

Full Text