P2ANet: A Large-Scale Benchmark for Dense Action Detection from Table Tennis Match Broadcasting Videos

Jiang Bian,Qingzhong Wang,Dejing Dou,Feixiang Lu,Jun Zhao,Jun Huang,Chen Liu,Haoyi Xiong,Tao Wang,Xuhong Li

doi:10.1145/3633516

Jiang Bian, Qingzhong Wang + Show 8 more

https://doi.org/10.1145/3633516

Copy DOI

Export

Save

Cite

Abstract
Full-Text
Similar Papers

Abstract

Listen

While deep learning has been widely used for video analytics, such as video classification and action detection, dense action detection with fast-moving subjects from sports videos is still challenging. In this work, we release yet another sports video benchmarkP2ANetforPingPong-Action detection, which consists of 2,721 video clips collected from the broadcasting videos of professional table tennis matches in World Table Tennis Championships and Olympiads. We work with a crew of table tennis professionals and referees on a specially designed annotation toolbox to obtain fine-grained action labels (in 14 classes) for every ping-pong action that appeared in the dataset, and formulate two sets of action detection problems—action localizationandaction recognition. We evaluate a number of commonly seen action recognition (e.g., TSM, TSN, Video SwinTransformer, and Slowfast) and action localization models (e.g., BSN, BSN++, BMN, TCANet), usingP2ANetfor both problems, under various settings. These models can only achieve 48% area under the AR-AN curve for localization and 82% top-one accuracy for recognition since the ping-pong actions are dense with fast-moving subjects but broadcasting videos are with only 25 FPS. The results confirm thatP2ANetis still a challenging task and can be used as a special benchmark for dense action detection from videos. We invite readers to examine our dataset by visiting the following link:https://github.com/Fred1991/P2ANET.

Full Text

Published Version

Check institute access

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

P2ANet: A Large-Scale Benchmark for Dense Action Detection from Table Tennis Match Broadcasting Videos

Abstract

Published Version

Talk to us

Similar Papers

More From: ACM Transactions on Multimedia Computing, Communications, and Applications

Lead the way for us

Journal: ACM Transactions on Multimedia Computing, Communications, and Applications	Publication Date: Jan 11, 2024
Citations: 4

Similar Papers

Introduction
Mayumi Itoh
-
Mayumi ItohMayumi Itoh
01 Jan 2010
01 Jan 2010

A Smart Analysis of Technical Skills of Top Male Table Tennis Players
Ching-Yi Sung
Smart Science | VOL. 7
Ching-Yi SungChing-Yi Sung
12 Aug 2019
Smart Science | VOL. 7

Leveraging Context for Multi-Label Action Recognition and Detection in Video

-

05 Nov 2020
05 Nov 2020

Nagoya World Table Tennis Championships
Mayumi Itoh
-
Mayumi ItohMayumi Itoh
01 Jan 2010
01 Jan 2010

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

P2ANet: A Large-Scale Benchmark for Dense Action Detection from Table Tennis Match Broadcasting Videos

Abstract

Published Version

Talk to us

Similar Papers

More From: ACM Transactions on Multimedia Computing, Communications, and Applications