Abstract
Deep learning approach has becoming a research interest in action recognition application due to its ability to surpass the performance of conventional machine learning approaches. Convolutional Neural Network (CNN) is among the widely used architecture in most action recognition works. There are various models exist in CNN but no research has been done to analyse which model has the best performance in recognizing actions for badminton sport. Hence, in this paper we are comparing the performance of four different pre-trained models of deep CNN in classifying the badminton match images to recognize the different actions done by the athlete. Four models used for comparison are AlexNet, GoogleNet, VggNet-16 and VggNet-19. The images used in this experimental work are categorized into two classes: hit and non-hit action. Firstly, each image frame was extracted from Yonex All England Man Single Match 2017 broadcast video. Then, the image frames were fed as the input to each classifier model for classification. Finally, the performance of each classifier model was evaluated by plotting its performance accuracy in form of confusion matrix. The result shows that the GoogleNet model has the highest classification accuracy which is 87.5% compared to other models. In a conclusion, the pre-trained GoogleNet model is capable to be used in recognizing actions in badminton match which might be useful in badminton sport performance technology.
Highlights
The computer vision field has been widely used in various applications such as video surveillance, human-computer interaction, robotics, object andaction recognition and sport analysis [1, 2]
In this paper, we are comparing the performance of four different established pre-trained models of deep Convolutional Neural Network (CNN) in classifying the badminton match images to recognize the different actions done by the athlete
The pre-trained GoogleNet model is capable to be used in recognizing actions in badminton match which might be useful in badminton sport performance technology
Summary
The computer vision field has been widely used in various applications such as video surveillance, human-computer interaction, robotics, object andaction recognition and sport analysis [1, 2]. Action recognition is a very challenging problem in computer vision field. There are two modalities in action recognition: 1) sensor-based modality and 2) video-based modality. In this new era of technology, where video transmissions are widely available online, video-based modality is increasingly used in recognizing the action.
Published Version (Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have