MAPoseNet: Animal pose estimation network via multi-scale convolutional attention

Sicong Liu,Qingcheng Fan,Shuqin Li,Chunjiang Zhao

doi:10.1016/j.jvcir.2023.103989

Abstract

Animal pose estimation serves as an upstream task for recognizing and understanding animal behavior. Over the last year, the accuracy of the deep learning-based method has steadily improved, but at the expense of the model’s inference speed. This paper uses an efficient and powerful model to improve inference speed and accuracy. The classic encoder–decoder architecture is chosen. For estimating animal pose, our model based on a feature pyramid and a multi-scale asymmetric convolution attention mechanism is developed and named MAPoseNet (Animal Pose Estimation Network Via Multi-scale Convolutional Attention). MAPoseNet consists of an encoder and a decoder. Rather than typical self-attention, the encoder’s attention mechanism comprises multi-scale, asymmetric convolutions that are lightweight and instrumental in improving inference speed. A feature pyramid and a feature balance module make up the decoder. The public dataset AP-10K is used to train and test MAPoseNet. A series of experimental results demonstrate that the MAPoseNet model provides cutting-edge performance. MAPoseNet outperforms HRFormer by 1.3 AP and 0.8 AR, with 33.7% fewer FLOPs and 66% faster inference speed. And our model surpasses HRNet and HRFormer on the Animal Pose dataset as well. Our model has achieved a win-win situation regarding inference speed and accuracy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

MAPoseNet: Animal pose estimation network via multi-scale convolutional attention

Abstract

Talk to us

Similar Papers

More From: Journal of Visual Communication and Image Representation

Lead the way for us

Journal: Journal of Visual Communication and Image Representation	Publication Date: Nov 25, 2023
Citations: 1

Similar Papers

Pre-Inpainting Convolutional Skip Triple Attention Segmentation Network for AGV Lane Detection in Overexposure Environment
Zongxin Yang ... Jiemin Hu
Applied Sciences | VOL. 12
Zongxin Yang, et. al.Zongxin Yang ... Jiemin Hu
21 Oct 2022
Applied Sciences | VOL. 12

Fully automatic MRI brain tumor segmentation using efficient spatial attention convolutional networks with composite loss
Indrajit Mazumdar ... Jayanta Mukherjee
Neurocomputing | VOL. 500
Indrajit Mazumdar, et. al.Indrajit Mazumdar ... Jayanta Mukherjee
18 May 2022
Neurocomputing | VOL. 500

Patient-independent seizure detection based on long-term iEEG and a novel lightweight CNN
Xiaopeng Si ... Shaoya Yin
Journal of Neural Engineering | VOL. 20
Xiaopeng Si, et. al.Xiaopeng Si ... Shaoya Yin
01 Feb 2023
Journal of Neural Engineering | VOL. 20

Deep domain adaptation, pseudo-labeling, and shallow network for accurate and fast gait prediction of unlabeled datasets.
Jaeyoung Na ... Woochul Nam
IEEE transactions on neural systems and rehabilitation engineering : a publication of the IEEE Engineering in Medicine and Biology Society | VOL. PP
Jaeyoung Na, et. al.Jaeyoung Na ... Woochul Nam
01 Jan 2023
IEEE transactions on neural systems and rehabilitation engineering : a publication of the IEEE Engineering in Medicine and Biology Society | VOL. PP

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

MAPoseNet: Animal pose estimation network via multi-scale convolutional attention

Abstract

Talk to us

Similar Papers

More From: Journal of Visual Communication and Image Representation