Abstract

Deep learning approach has becoming a research interest in action recognition application due to its ability to surpass the performance of conventional machine learning approaches. Convolutional Neural Network (CNN) is among the widely used architecture in most action recognition works. There are various models exist in CNN but no research has been done to analyse which model has the best performance in recognizing actions for badminton sport. Hence, in this paper we are comparing the performance of four different pre-trained models of deep CNN in classifying the badminton match images to recognize the different actions done by the athlete. Four models used for comparison are AlexNet, GoogleNet, VggNet-16 and VggNet-19. The images used in this experimental work are categorized into two classes: hit and non-hit action. Firstly, each image frame was extracted from Yonex All England Man Single Match 2017 broadcast video. Then, the image frames were fed as the input to each classifier model for classification. Finally, the performance of each classifier model was evaluated by plotting its performance accuracy in form of confusion matrix. The result shows that the GoogleNet model has the highest classification accuracy which is 87.5% compared to other models. In a conclusion, the pre-trained GoogleNet model is capable to be used in recognizing actions in badminton match which might be useful in badminton sport performance technology.

Highlights

  • The computer vision field has been widely used in various applications such as video surveillance, human-computer interaction, robotics, object andaction recognition and sport analysis [1, 2]

  • In this paper, we are comparing the performance of four different established pre-trained models of deep Convolutional Neural Network (CNN) in classifying the badminton match images to recognize the different actions done by the athlete

  • The pre-trained GoogleNet model is capable to be used in recognizing actions in badminton match which might be useful in badminton sport performance technology

Read more

Summary

Introduction

The computer vision field has been widely used in various applications such as video surveillance, human-computer interaction, robotics, object andaction recognition and sport analysis [1, 2]. Action recognition is a very challenging problem in computer vision field. There are two modalities in action recognition: 1) sensor-based modality and 2) video-based modality. In this new era of technology, where video transmissions are widely available online, video-based modality is increasingly used in recognizing the action.

Objectives
Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.