Abstract. The intelligent processing of human facial expressions has gained popularity due to the growth of big data and the expanding knowledge of AI and machine learning. In this paper, the author reviews the current mainstream methods for face recognition using convolutional neural network (CNN) models in deep learning, providing insights and future directions. By collecting and analyzing research findings, the characteristics, strengths and weaknesses of each model are discussed. The results indicate that CNN-based face recognition systems typically involve several steps: face detection, face alignment, face representation, face matching, and post-processing. By examining expression recognition with models like LeNet-5, AlexNet, VGGNet, and GoogleNet, the author concludes that CNNs possess features such as being data-driven, capable of feature learning, multi-level structured, adaptable to different domains, capable of real-time processing, and supporting multi-task learning. This technology has made significant strides and offers considerable potential for further development.
Read full abstract