Abstract

Abstract Deep subspace learning (DSL) models based on the principal component analysis network (PCANet) and linear discriminant analysis network (LDANet) have shown to be promising alternatives to deep learning models when there are computing power and training data constraints. However, high dimensionality of the feature space remains a major issue for DSL models. This paper presents a novel DSL approach based on an extended dictionary representation with deep subspace features for facial expression recognition. First, we propose the use of feature pooling with DSL by adding rank-based average pooling between each subspace mapping layer. We then use spatial pyramid pooling in the output layer to overcome the high-dimensionality problem. After that, the extended dictionary is formed by expanding the feature dictionary. Finally, we apply sparse representation classification with squared l2-regularization to improve the recognition accuracy. Comprehensive experiments based on several well-established datasets confirm that our proposed approach has superior performance compared to both the baseline as well as state-of-the-art PCANet and LDANet methods, not just in terms of accuracy but also robustness against block occlusion and random corruption.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call