Human Action Recognition and Note Recognition: A Deep Learning Approach Using STA-GCN.

Avirmed Enkhbat,Pimpa Cheewaprakobkit,Timothy K Shih

doi:10.3390/s24082519

Abstract

Human action recognition (HAR) is growing in machine learning with a wide range of applications. One challenging aspect of HAR is recognizing human actions while playing music, further complicated by the need to recognize the musical notes being played. This paper proposes a deep learning-based method for simultaneous HAR and musical note recognition in music performances. We conducted experiments on Morin khuur performances, a traditional Mongolian instrument. The proposed method consists of two stages. First, we created a new dataset of Morin khuur performances. We used motion capture systems and depth sensors to collect data that includes hand keypoints, instrument segmentation information, and detailed movement information. We then analyzed RGB images, depth images, and motion data to determine which type of data provides the most valuable features for recognizing actions and notes in music performances. The second stage utilizes a Spatial Temporal Attention Graph Convolutional Network (STA-GCN) to recognize musical notes as continuous gestures. The STA-GCN model is designed to learn the relationships between hand keypoints and instrument segmentation information, which are crucial for accurate recognition. Evaluation on our dataset demonstrates that our model outperforms the traditional ST-GCN model, achieving an accuracy of 81.4%.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Sensors	Publication Date: Apr 14, 2024
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Human Action Recognition and Note Recognition: A Deep Learning Approach Using STA-GCN.

Abstract

Talk to us

Similar Papers

More From: Sensors

Lead the way for us

Similar Papers

A Self-Attention Augmented Graph Convolutional Clustering Networks for Skeleton-Based Video Anomaly Behavior Detection
Chengming Liu ... Weiwei Li
Applied Sciences | VOL. 12
Chengming Liu, et. al.Chengming Liu ... Weiwei Li
21 Dec 2021
Applied Sciences | VOL. 12

A spatial attentive and temporal dilated (SATD) GCN for skeleton‐based action recognition
Jiaxu Zhang ... Yongtao Qin
CAAI Transactions on Intelligence Technology | VOL. 7
Jiaxu Zhang, et. al.Jiaxu Zhang ... Yongtao Qin
17 Mar 2021
CAAI Transactions on Intelligence Technology | VOL. 7

Spatial Temporal Variation Graph Convolutional Networks (STV-GCN) for Skeleton-Based Emotional Action Recognition
Ming-Fong Tsai ... Chiung-Hung Chen
IEEE Access | VOL. 9
Ming-Fong Tsai, et. al.Ming-Fong Tsai ... Chiung-Hung Chen
01 Jan 2020
IEEE Access | VOL. 9

Enhanced Spatial and Extended Temporal Graph Convolutional Network for Skeleton-Based Action Recognition.
Fanjia Li ... Juanjuan Li
Sensors | VOL. 20
Fanjia Li, et. al.Fanjia Li ... Juanjuan Li
15 Sep 2020
Sensors | VOL. 20

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Human Action Recognition and Note Recognition: A Deep Learning Approach Using STA-GCN.

Abstract

Talk to us

Similar Papers

More From: Sensors