• A method of human key-point coordinate representation based on probability distribution perception. • Addressing issues :key-point representation quantization error , algorithm inconsistency. • This method gives a high level of precision in the experiment. Existing human pose estimation methods have some problems, such as key-point representation quantization error, algorithm inconsistency, and easy-to-introduce unequal quantization error. Addressing these issues, this paper proposes a method of human key-point coordinate representation based on probability distribution perception on a depth convolution neural network . In this method, the perceived probability of the key-point coordinate distribution is extracted by a convolutional neural network , and the cross entropy is formed with the prediction probability distribution as the loss function. Under the constraint of minimizing the Kullback-Leibler (KL) distance between the prediction and the ground truth, the convolution neural network is optimized, and the coordinates of the heat map are finally located. The experimental results show that compared with the traditional method based on a heat map, the proposed method can accelerate the reasoning speed of the model in the COCO training set, and at the same time, it can improve the map by 1.7
Read full abstract