Abstract

This paper addresses the Multi-Athlete Tracking (MAT) problem, which plays a crucial role in sports video analysis. There exist specific challenges in MAT, e.g., athletes share a high similarity in appearance and frequently occlude with each other, making existing approaches not applicable for this task. To address this problem, we propose a novel online multiple athlete tracking approach which make use of long-term temporal pose dynamics for better distinguishing different athletes. Firstly, we design a Pose-based Triple Stream Network (PTSN) based on Long Short-Term Memory (LSTM) networks, capable of modeling long-term temporal pose dynamics of athletes, including pose-based appearance, motion and athletes’ interaction clues. Secondly, we propose a multi-state online matching algorithm based on bipartite graph matching and similarity scores produced by PTSN. It is robust to noisy detections and occlusions due to the reliable transitions of multiple detection states. We evaluate our method on the APIDIS, NCAA Basketball and VolleyTrack databases, and the experiment results demonstrate its effectiveness.

Highlights

  • Sensors 2021, 21, 197. https://In recent years, sports video analysis has received increasing attention in academia and industry due to its scientific challenges and promising applications

  • We design a multi-state online matching algorithm based on bipartite graph matching

  • We evaluate our method by comparing it with recently proposed advanced multiobject trackers on the APIDIS, NCAA Basketball and VolleyTrack databases, and the experiment results demonstrate the effectiveness of our method

Read more

Summary

Introduction

Sports video analysis has received increasing attention in academia and industry due to its scientific challenges and promising applications. In real sports scenes, there exist some specific difficulties: (1) athletes share a high similarity in appearance (dressing, figure, etc), and they frequently occlude with each other; (2) athletes often have abrupt positions and complex actions These facts make the existing MOT methods [6,7,8,9], especially the ones focus on appearance and simple motion clues, lose efficacy. We design a multi-state online matching algorithm based on bipartite graph matching It uses the affinities produced by PTSN to associate the athlete detections frame by frame and accomplish the tracking. Memory (LSTM) networks, capable of modeling long-term temporal pose dynamics of athletes and generating robust association affinities. We design a multi-state online matching algorithm based on multiple detection states and reliable transitions with the association affinities, improves the robustness to noisy detections and occlusion.

Athlete Tracking
Multi-Object Tracking
Human Pose Estimation
Multi-Athlete Tracking Approach
Overall Architecture of PTSN
Multi-State Online Matching Algorithm
1: Initial
Databases
Implementation Details
Evaluation Indexes
Results Analysis
Methods
Conclusions
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call