Action Recognition Process Research Articles

The article describes the formulation of the problem of recognition of the movements of objects in a video sequence, the stages of its solution, the analysis of the basic methods of each of the stages. A wide range of applications and growing requirements on the quality of recognition determines the relevance of the study. The process of action recognition and detection begins with extracting useful features, from the input video sequence. Features are then processed through a classier to identify the action class (for example, running, walking, jumping, various gestures). The article describes the main feature descriptors, in the filter-based category: histogram of oriented gradients, cuboid descriptor, scale-invariant feature transform, gradient location-orientation histogram, local trinary patterns, and spatiotemporal patches, optical flow-based descriptors: histograms of optical flow, the motion boundary histogram, dense trajectory, convolutional neural network-based descriptors. Some algorithms require the extraction of primitive features and further refinement of the auxiliary features before they can be passed to the classifier. Examples of the use of specialized primitive features are methods based on silhouettes / contours and methods based on object tracking. There are methods for classifying extracted features, including the following: support vector machines, adaptive boost, artificial neural networks, convolutional neural networks. The key difficulties arising in solving the problem are considered. There are ways to compare various methods. One of the ways to draw comparisons is to quantitatively evaluate each approach on the same database with the same protocol. From simple KTH datasets and Weizmannnd to Carnegie Mellon University Crowded Videos dataset and Microsoft Research Action Group dataset to more complex video conditions and large-scale UCF101 and ActivityNet datasets. Existing approaches to recognition of motion in video sequences are analyzed. The article reveals characteristics, strengths and weaknesses of the various methods of detecting features and their classification. Leading methods that show the best results widely use convolutional neural networks. One of such methods is a spatio-temporal graph convolutional neural network for action recognition based on the object's skeleton. A method for further research and improvement was chosen.

Mirror neurons within a monkey's premotor area F5 fire not only when the monkey performs a certain class of actions but also when the monkey observes another monkey (or the experimenter) perform a similar action. It has thus been argued that these neurons are crucial for understanding of actions by others. We offer the hand-state hypothesis as a new explanation of the evolution of this capability: the basic functionality of the F5 mirror system is to elaborate the appropriate feedback - what we call the hand state - for opposition-space based control of manual grasping of an object. Given this functionality, the social role of the F5 mirror system in understanding the actions of others may be seen as an exaptation gained by generalizing from one's own hand to an other's hand. In other words, mirror neurons first evolved to augment the "canonical" F5 neurons (active during self-movement based on observation of an object) by providing visual feedback on "hand state," relating the shape of the hand to the shape of the object. We then introduce the MNS1 (mirror neuron system 1) model of F5 and related brain regions. The existing Fagg-Arbib-Rizzolatti-Sakata model represents circuitry for visually guided grasping of objects, linking the anterior intraparietal area (AIP) with F5 canonical neurons. The MNS1 model extends the AIP visual pathway by also modeling pathways, directed toward F5 mirror neurons, which match arm-hand trajectories to the affordances and location of a potential target object. We present the basic schemas for the MNS1 model, then aggregate them into three "grand schemas" - visual analysis of hand state, reach and grasp, and the core mirror circuit - for each of which we present a useful implementation (a non-neural visual processing system, a multijoint 3-D kinematics simulator, and a learning neural network, respectively). With this implementation we show how the mirror system may learn to recognize actions already in the repertoire of the F5 canonical neurons. We show that the connectivity pattern of mirror neuron circuitry can be established through training, and that the resultant network can exhibit a range of novel, physiologically interesting behaviors during the process of action recognition. We train the system on the basis of final grasp but then observe the whole time course of mirror neuron activity, yielding predictions for neurophysiological experiments under conditions of spatial perturbation, altered kinematics, and ambiguous grasp execution which highlight the importance of the timing of mirror neuron activity.

Action Recognition Process Research Articles

Related Topics

Articles published on Action Recognition Process

Effector-specific motor simulation supplements core action recognition processes inadverse conditions.

Optimal Deep Convolutional Neural Network with Pose Estimation for Human Activity Recognition

Human Action recognition using STIP Evaluation techniques

A Real-Time Wearable Assist System for Upper Extremity Throwing Action Based on Accelerometers.

Action and movements recognition methods

Adaptation aftereffects reveal representations for encoding of contingent social actions

Action recognition is sensitive to the identity of the actor

Sensorimotor Coarticulation in the Execution and Recognition of Intentional Actions.

RegFrame: fast recognition of simple human actions on a stand-alone mobile device

Visual adaptation dominates bimodal visual-motor action adaptation.

Distinct spatio-temporal profiles of beta-oscillations within visual and sensorimotor areas during action recognition as revealed by MEG

Putting Actions in Context: Visual Action Adaptation Aftereffects Are Modulated by Social Contexts

Three-dimensional action recognition using volume integrals

From motor to sensory processing in mirror neuron computational modelling

The neural substrate of gesture recognition

Schema design and implementation of the grasp-related mirror neuron system.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Action Recognition Process Research Articles

Related Topics

Articles published on Action Recognition Process

Effector-specific motor simulation supplements core action recognition processes inadverse conditions.

Optimal Deep Convolutional Neural Network with Pose Estimation for Human Activity Recognition

Human Action recognition using STIP Evaluation techniques

A Real-Time Wearable Assist System for Upper Extremity Throwing Action Based on Accelerometers.

Action and movements recognition methods

Adaptation aftereffects reveal representations for encoding of contingent social actions

Action recognition is sensitive to the identity of the actor

Sensorimotor Coarticulation in the Execution and Recognition of Intentional Actions.

RegFrame: fast recognition of simple human actions on a stand-alone mobile device

Visual adaptation dominates bimodal visual-motor action adaptation.

Distinct spatio-temporal profiles of beta-oscillations within visual and sensorimotor areas during action recognition as revealed by MEG

Putting Actions in Context: Visual Action Adaptation Aftereffects Are Modulated by Social Contexts

Three-dimensional action recognition using volume integrals

From motor to sensory processing in mirror neuron computational modelling

The neural substrate of gesture recognition

Schema design and implementation of the grasp-related mirror neuron system.