Abstract
Significant progress has been made in individual livestock recognition based on convolutional neural networks (CNN), however, their performance still needs improvement. Vision transformer (ViT) emerged as a cutting-edge approach, which has been successfully applied in many tasks of the computer vision field. The superior performance of ViT motivates us to study whether ViT can provide more accurate results for sheep face recognition.In this study, we propose MobileViTFace for sheep face recognition. MobileViTFace is a lightweight sheep face recognition model which combines the convolutional and transformer structures. Compared with the standard ViT model, MobileViTFace does not require too much training data and high computational complexity and is more convenient to deploy on edge devices. Extensive benchmarking tests illustrate that MobileViTFace can secure competitive performance, which achieved 97.13% recognition accuracy on 7,434 sheep face images containing 186 sheep, significantly better than lightweight models based on convolutional structures such as MobileNet, EfficientNet, etc. Parameters and floating-point operations (FLOPs) are reduced by five times compared to ResNet-50, which has similar recognition accuracy. Real-time and accurate recognition results are obtained on the Jetson Nano-based edge computing platform, which is helpful for practical production.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.