Temporal Action Detection in Untrimmed Videos from Fine to Coarse Granularity

Guangle Yao,Ping Jiang,Tao Lei,Xianyuan Liu

doi:10.3390/app8101924

Guangle Yao, Ping Jiang + Show 2 more

Open Access

PDF Available

https://doi.org/10.3390/app8101924

Copy DOI

Export

Save

Cite

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

Temporal action detection in long, untrimmed videos is an important yet challenging task that requires not only recognizing the categories of actions in videos, but also localizing the start and end times of each action. Recent years, artificial neural networks, such as Convolutional Neural Network (CNN) and Long Short-Term Memory (LSTM) improve the performance significantly in various computer vision tasks, including action detection. In this paper, we make the most of different granular classifiers and propose to detect action from fine to coarse granularity, which is also in line with the people’s detection habits. Our action detection method is built in the ‘proposal then classification’ framework. We employ several neural network architectures as deep information extractor and segment-level (fine granular) and window-level (coarse granular) classifiers. Each of the proposal and classification steps is executed from the segment to window level. The experimental results show that our method not only achieves detection performance that is comparable to that of state-of-the-art methods, but also has a relatively balanced performance for different action categories.

Highlights

Video analysis is important for applications ranging from robotics, human-computer interaction to intelligent surveillance
The second framework ‘proposal classification’ [3,4] draws inspiration from the Region-based Convolutional Neural Networks (R-CNN) object detection [5] and its upgraded versions [6,7]. It is implemented in two steps: (1) temporal action proposals, which produces a set of windows that are likely to contain an action instance; and, (2) action classification, which provides the specific category of the action proposal
We propose to detect action in video from fine to coarse granularity, which is in line

Summary

Introduction

Video analysis is important for applications ranging from robotics, human-computer interaction to intelligent surveillance. The second framework ‘proposal classification’ [3,4] draws inspiration from the Region-based Convolutional Neural Networks (R-CNN) object detection [5] and its upgraded versions [6,7] It is implemented in two steps: (1) temporal action proposals, which produces a set of windows that are likely to contain an action instance; and, (2) action classification, which provides the specific category of the action proposal. Post-processing; (b) proposal classification; (c) single stream; and, (d) temporal upsampling Most of these mentioned action detection methods design fine granular [4] trained a 3D CNN classifier via Multi-task learning method for segment-level proposal, and localization. Both of the methods [3,4] used post-processing to obtain the final detection results.

Overview

Our Method

Overview of Our Method

Res3D Architecture

Discriminative

The the binary binary RGB

Discriminative temporal search

Regression Network

Datasets and Evaluation Metrics

Experiments

Implementation Details

Exploratory Study

Average

Comparison with State-of-the-Art Methods

28.7 Proposal

Limitation

10. In regionregion from from

Findings

Conclusions

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Sciences	Publication Date: Oct 15, 2018
Citations: 7	License type: CC BY 4.0

R Discovery Prime

Temporal Action Detection in Untrimmed Videos from Fine to Coarse Granularity

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Capsule Boundary Network With 3D Convolutional Dynamic Routing for Temporal Action Detection
Yaosen Chen ... Yan Shen
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 32
Yaosen Chen, et. al.Yaosen Chen ... Yan Shen
01 May 2022
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 32

A coarse-to-fine temporal action detection method combining light and heavy networks
Fan Zhao ... Kaixuan Wang
Multimedia Tools and Applications | VOL. 82
Fan Zhao, et. al.Fan Zhao ... Kaixuan Wang
10 Jun 2022
Multimedia Tools and Applications | VOL. 82

Temporal Action Detection Based on Hierarchical Object Detection Networks
Yi-Hui Wu ... Hua-Tsung Chen
-
Yi-Hui Wu, et. al.Yi-Hui Wu ... Hua-Tsung Chen
01 Aug 2019
01 Aug 2019

Temporal Action Detection by Joint Identification-Verification
Wen Wang ... Yongjian Wu
-
Wen Wang, et. al.Wen Wang ... Yongjian Wu
01 Aug 2018
01 Aug 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Temporal Action Detection in Untrimmed Videos from Fine to Coarse Granularity

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Applied Sciences