RGB-D data collected from Microsoft Kinect are an easily available media providing the additional depth information besides RGB, which has great potentials in improving the performance of the facial analysis applications. In this paper, we focus on facial descriptor extraction from depth channel. First, we design a facial shape descriptor based on local binary patterns encoding from surface normal. The novelty lies in that it considers local structures involved in both inner component and inter-components of surface normal. Second, we propose a tensor representation for histogram arrays embedded in multidimensional space and further use multilinear principal component analysis to obtain an optimal trade-off between efficacy and efficiency. The experiments conducted on two publicly available databases, named CurtinFaces and Eurecom, have demonstrated the promising results achieved by proposed schemes on identity and ethnicity classification.