Accelerate Literature Icon
Want to do a literature review? Try our new Literature Review workflow

Surface EMG-Based Hand Gesture Recognition Using a Hybrid Multistream Deep Learning Architecture.

  • Abstract
  • Literature Map
  • Similar Papers
Abstract
Translate article icon Translate Article Star icon

Surface electromyography (sEMG) enables non-invasive measurement of muscle activity for applications such as human-machine interaction, rehabilitation, and prosthesis control. However, high noise levels, inter-subject variability, and the complex nature of muscle activation hinder robust gesture classification. This study proposes a multistream hybrid deep-learning architecture for the FORS-EMG dataset to address these challenges. The model integrates Temporal Convolutional Networks (TCN), depthwise separable convolutions, bidirectional Long Short-Term Memory (LSTM)-Gated Recurrent Unit (GRU) layers, and a Transformer encoder to capture complementary temporal and spectral patterns, and an ArcFace-based classifier to enhance class separability. We evaluate the approach under three protocols: subject-wise, random split without augmentation, and random split with augmentation. In the augmented random-split setting, the model attains 96.4% accuracy, surpassing previously reported values. In the subject-wise setting, accuracy is 74%, revealing limited cross-user generalization. The results demonstrate the method's high performance and highlight the impact of data-partition strategies for real-world sEMG-based gesture recognition.

Similar Papers
  • Research Article
  • Cite Count Icon 17
  • 10.3390/s24175631
A Novel TCN-LSTM Hybrid Model for sEMG-Based Continuous Estimation of Wrist Joint Angles.
  • Aug 30, 2024
  • Sensors (Basel, Switzerland)
  • Jiale Du + 4 more

Surface electromyography (sEMG) offers a novel method in human-machine interactions (HMIs) since it is a distinct physiological electrical signal that conceals human movement intention and muscle information. Unfortunately, the nonlinear and non-smooth features of sEMG signals often make joint angle estimation difficult. This paper proposes a joint angle prediction model for the continuous estimation of wrist motion angle changes based on sEMG signals. The proposed model combines a temporal convolutional network (TCN) with a long short-term memory (LSTM) network, where the TCN can sense local information and mine the deeper information of the sEMG signals, while LSTM, with its excellent temporal memory capability, can make up for the lack of the ability of the TCN to capture the long-term dependence of the sEMG signals, resulting in a better prediction. We validated the proposed method in the publicly available Ninapro DB1 dataset by selecting the first eight subjects and picking three types of wrist-dependent movements: wrist flexion (WF), wrist ulnar deviation (WUD), and wrist extension and closed hand (WECH). Finally, the proposed TCN-LSTM model was compared with the TCN and LSTM models. The proposed TCN-LSTM outperformed the TCN and LSTM models in terms of the root mean square error (RMSE) and average coefficient of determination (R2). The TCN-LSTM model achieved an average RMSE of 0.064, representing a 41% reduction compared to the TCN model and a 52% reduction compared to the LSTM model. The TCN-LSTM also achieved an average R2 of 0.93, indicating an 11% improvement over the TCN model and an 18% improvement over the LSTM model.

  • PDF Download Icon
  • Research Article
  • Cite Count Icon 23
  • 10.3390/electronics11101516
A Multivariate Temporal Convolutional Attention Network for Time-Series Forecasting
  • May 10, 2022
  • Electronics
  • Renzhuo Wan + 4 more

Multivariate time-series forecasting is one of the crucial and persistent challenges in time-series forecasting tasks. As a kind of data with multivariate correlation and volatility, multivariate time series impose highly nonlinear time characteristics on the forecasting model. In this paper, a new multivariate time-series forecasting model, multivariate temporal convolutional attention network (MTCAN), based on a self-attentive mechanism is proposed. MTCAN is based on the Convolution Neural Network (CNN) model, using 1D dilated convolution as the basic unit to construct asymmetric blocks, and then, the feature extraction is performed by the self-attention mechanism to finally obtain the prediction results. The input and output lengths of this network can be determined flexibly. The validation of the method is carried out with three different multivariate time-series datasets. The reliability and accuracy of the prediction results are compared with Long Short-Term Memory (LSTM), Gated Recurrent Unit (GRU), Convolutional Long Short-Term Memory (ConvLSTM), and Temporal Convolutional Network (TCN). The prediction results show that the model proposed in this paper has significantly improved prediction accuracy and generalization.

  • PDF Download Icon
  • Research Article
  • Cite Count Icon 12
  • 10.3390/s20041223
Deep Neural Networks for the Classification of Pure and Impure Strawberry Purees.
  • Feb 23, 2020
  • Sensors
  • Zhong Zheng + 4 more

In this paper, a comparative study of the effectiveness of deep neural networks (DNNs) in the classification of pure and impure purees is conducted. Three different types of deep neural networks (DNNs)—the Gated Recurrent Unit (GRU), the Long Short Term Memory (LSTM), and the temporal convolutional network (TCN)—are employed for the detection of adulteration of strawberry purees. The Strawberry dataset, a time series spectroscopy dataset from the UCR time series classification repository, is utilized to evaluate the performance of different DNNs. Experimental results demonstrate that the TCN is able to obtain a higher classification accuracy than the GRU and LSTM. Moreover, the TCN achieves a new state-of-the-art classification accuracy on the Strawberry dataset. These results indicates the great potential of using the TCN for the detection of adulteration of fruit purees in the future.

  • Research Article
  • Cite Count Icon 146
  • 10.1016/j.energy.2022.125872
A comparative study of different deep learning algorithms for lithium-ion batteries on state-of-charge estimation
  • Oct 26, 2022
  • Energy
  • Shanshan Guo + 1 more

A comparative study of different deep learning algorithms for lithium-ion batteries on state-of-charge estimation

  • Book Chapter
  • Cite Count Icon 6
  • 10.1007/978-3-030-55180-3_28
Comparison of Hybrid Recurrent Neural Networks for Univariate Time Series Forecasting
  • Aug 25, 2020
  • Anibal Flores + 2 more

The work presented in this paper aims to improve the accuracy of forecasting models in univariate time series, for this it is experimented with different hybrid models of two and four layers based on recurrent neural networks such as Long Short-Term Memory (LSTM) and Gated Recurrent Unit (GRU). It is experimented with two time series corresponding to downward thermal infrared and all sky insolation incident on a horizontal surface obtained from NASA’s repository. In the first time series, the results achieved by the two-layer hybrid models (LSTM + GRU and GRU + LSTM) outperformed the results achieved by the non-hybrid models (LSTM + LSTM and GRU + GRU); while only two of six four-layer hybrid models (GRU + LSTM + GRU + LSTM and LSTM + LSTM + GRU + GRU) outperformed non-hybrid models (LSTM + LSTM + LSTM + LSTM and GRU + GRU + GRU + GRU). In the second time series, only one model (LSTM + GRU) of two hybrid models outperformed the two non-hybrid models (LSTM + LSTM and GRU + GRU); while the four-layer hybrid models, none could exceed the results of the non-hybrid models.

  • Research Article
  • Cite Count Icon 28
  • 10.1109/tase.2024.3388709
Learning an Enhanced TCN-LSTM Network for Temperature Process Modeling in Rotary Kilns
  • Jan 1, 2025
  • IEEE Transactions on Automation Science and Engineering
  • Xiaoming Wang + 5 more

Accurate prediction of reaction temperature in rotary kiln is essential to realize its advanced process control and operational optimizations. However, the complexity of the physical and chemical reactions in the rotary kiln makes it difficult for the traditional mechanism model to characterize the dynamic kiln process. In this study, a deep learning-based temperature prediction model is proposed to accurately track the temperature changes during the production process of rotary kiln. The proposed model integrates a temporal convolutional network (TCN) with a long short-term memory (LSTM) network. The former enables the proposed model to be aware of local context and can effectively compute local characteristics, whereas the latter with its time-memory capability can better capture the long-term dependencies of data to extract deep-level features. To enhance the prediction accuracy, the proposed model is further improved by introducing a novel feature fusion and hybrid pooling layer to merge the original input with TCN output features, which efficiently preserve the detailed information and significantly reduce model complexity. Attention mechanism is also incorporated after the LSTM network to concentrate on the key moment information and improve the model performance. Monitoring data from a zinc rotary kiln at a field site is used for model training and testing. Results demonstrate that the proposed model can achieve the best mean square error, 0.165, exhibiting a promising prediction accuracy of the rotary kiln’s temperature. It outperforms the state-of-the-art machine learning-based prediction models such as LSTM, Gated Recurrent Unit (GRU) and TCN. <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">Note to Practitioners</i> —This work considers high-accuracy prediction of the temperature changes during the production process of rotary kiln. It is critical to well predict kiln tail temperature by capturing long-term dependencies and extracting global and local features from time series data. Yet, this is an extremely challenging task because rotary kiln operation is a complex thermal process with multivariate, pure hysteresis, strong interference, nonlinearity and strong coupling characteristics. As a result, these issues inhibit the establishment of a model that can accurately characterize the production process of the rotary kiln and impedes the realization of automation in volatile kiln production. Current prediction methods usually adopt mechanism-based and data-driven, there are suffer from long computation time, unknown kinetic parameters and poor accuracy. Thus, they fail to effectively learn features and accurately establish model. This work proposes a temperature prediction dynamic model named enhanced TCN-LSTM, which combines a temporal convolutional network (TCN) with a long short-term memory (LSTM) network, incorporating a unique feature fusion technique along with a hybrid pooling layer and attention layer. This integration enables the model to extract both global and local features, thereby preserving detailed information efficiently while significantly reducing model complexity and enhance the stability and robustness. Experimental results based on the zinc rotary kiln dataset demonstrate that the proposed model exhibits a promising prediction accuracy of the rotary kiln’s temperature. In the future, it can be readily implemented and applied in many industrial areas such as systems modeling, time series forecasting, and predictive maintenance.

  • Research Article
  • Cite Count Icon 3
  • 10.1186/s40537-025-01207-5
Cloud based real-time multivariate multi-step prediction of systolic blood pressure and heart rate using temporal convolutional network and Apache Spark
  • Jul 16, 2025
  • Journal of Big Data
  • Hager Saleh + 5 more

Systolic blood pressure (SBP) and heart rate (HR) are critical for proactive clinical decision-making, particularly in intensive care environments. Early identification of abnormalities in these vital signs helps patients receive appropriate treatment and reduce associated health degradation. This study presents FCMS-iDMM, a fog/cloud-based real-time forecasting system that simultaneously predicts future SBP and HR values using deep learning models integrated with big data streaming platforms. The research involves two main phases: offline model and online forecasting pipeline optimization. During the offline model development, we explore the single-task and multi-task modeling. Different optimization steps have been explored. The single task includes forecasting HR and SBP in multi-step heads using Temporal Convolutional Networks (TCN), Long Short-Term Memory (LSTM), and Gated Recurrent Unit (GRU). Multi-task comprises forecasting HR and SBP in diverse multi-step heads as puerperal employing TCN, sequence-to-sequence (seq2seq), and Autoencoder models using LSTM and GRU. Extensive results are accomplished by the Medical Information Mart for Intensive Care III (MIMIC III) to assess the performance of the proposed multi-task DL model. Multi-task learning (MTL) models based on Temporal Convolutional Networks (TCNs) achieved superior performance. In forecasting 8 min, TCN recorded the best performance compared to other models with 1.5428 RMSE, 1.0871 MAE, and 1.269 MAPE for HR and 4.1446 RMSE, 2.4323 MAE and 2.5237 MAPE for SBP in multi-task. Simulated sensors, Apache Kafka, and Apache Spark are used to develop the real-time HR and SBP online forecasting pipeline. Experimental validation using the MIMIC-III database confirmed that multi-task models outperform single-task approaches across multiple forecasting horizons. The proposed system offers a scalable and efficient solution for real-time monitoring of vital signs, paving the way for predictive, patient-centered healthcare systems.

  • Research Article
  • 10.1016/j.atech.2026.101908
Temporal deep learning for satellite-based sugarcane monitoring and cycle age estimation
  • Mar 1, 2026
  • Smart Agricultural Technology
  • Luan Pedro Souza Silva + 1 more

Temporal deep learning for satellite-based sugarcane monitoring and cycle age estimation

  • PDF Download Icon
  • Research Article
  • Cite Count Icon 1
  • 10.3390/app14198737
Multi-Scale Price Forecasting Based on Data Augmentation
  • Sep 27, 2024
  • Applied Sciences
  • Ting Yue + 1 more

When considering agricultural commodity transaction data, long sampling intervals or data sparsity may lead to small samples. Furthermore, training on small samples can lead to overfitting and makes it hard to capture the fine-grained fluctuations in the data. In this study, a multi-scale forecasting approach combined with a Generative Adversarial Network (GAN) and Temporal Convolutional Network (TCN) is proposed to address the problems related to small sample prediction. First, a Time-series Generative Adversarial Network (TimeGAN) is used to expand the multi-dimensional data and t-SNE is utilized to evaluate the similarity between the original and synthetic data. Second, a greedy algorithm is exploited to calculate the information gain, in order to obtain important features, based on XGBoost. Meanwhile, TCN residual blocks and dilated convolutions are used to tackle the issue of gradient disappearance. Finally, an attention mechanism is added to the TCN, which is beneficial in terms of improving the forecasting accuracy. Experiments are conducted on three products, garlic, ginger and chili. Taking garlic as an example, the RMSE of the proposed method was reduced by 1.7% and 1% when compared to the SVR and RF models, respectively. Its R2 accuracy was also improved (by 4.3% and 3.4%, respectively). Furthermore, TCN-attention and TCN were found to require less time compared to GRU and LSTM. The accuracy of the proposed method increased by about 5% when compared to that without TimeGAN in the ablation study. Moreover, compared with TCN, the Gated Recurrent Unit (GRU), and the Long Short-term Memory (LSTM) model in the multi-scale price forecasting task, the proposed method can better utilize small samples and high-dimensional data, leading to improved performance. Additionally, the proposed model is compared to the Transformer and TimesNet models in terms of its accuracy, deployment cost, and other metrics.

  • PDF Download Icon
  • Research Article
  • Cite Count Icon 5
  • 10.3390/app142210461
Robust Human Activity Recognition for Intelligent Transportation Systems Using Smartphone Sensors: A Position-Independent Approach
  • Nov 13, 2024
  • Applied Sciences
  • John Benedict Lazaro Bernardo + 4 more

This study explores Human Activity Recognition (HAR) using smartphone sensors to address the challenges posed by position-dependent datasets. We propose a position-independent system that leverages data from accelerometers, gyroscopes, linear accelerometers, and gravity sensors collected from smartphones placed either on the chest or in the left/right leg pocket. The performance of traditional machine learning algorithms (Decision Trees (DT), K-Nearest Neighbors (KNN), Random Forest (RF), Support Vector Classifier (SVC), and XGBoost) is compared against deep learning models (Gated Recurrent Unit (GRU), Long Short-Term Memory (LSTM), Temporal Convolutional Networks (TCN), and Transformer models) under two sensor configurations. Our findings highlight that the Temporal Convolutional Network (TCN) model consistently outperforms other models, particularly in the four-sensor non-overlapping configuration, achieving the highest accuracy of 97.70%. Deep learning models such as LSTM, GRU, and Transformer also demonstrate strong performance, showcasing their effectiveness in capturing temporal dependencies in HAR tasks. Traditional machine learning models, including RF and XGBoost, provide reasonable performance but do not match the accuracy of deep learning models. Additionally, incorporating data from linear accelerometers and gravity sensors led to slight improvements over using accelerometer and gyroscope data alone. This research enhances the recognition of passenger behaviors for intelligent transportation systems, contributing to more efficient congestion management and emergency response strategies.

  • Research Article
  • Cite Count Icon 1
  • 10.1038/s41598-025-14423-z
TCN-QRNN model for short term energy consumption forecasting with increased accuracy and optimized computational efficiency
  • Aug 5, 2025
  • Scientific Reports
  • Lesia Mochurad + 1 more

In the context of the growing volume and complexity of data, traditional methods of energy consumption forecasting, such as Recurrent Neural Networks (RNN), face computational complexity issues that limit their real-time application. This also complicates the effective management of energy systems. In this work, a new model is proposed that combines the advantages of Temporal Convolutional Networks (TCN) and Quasi-Recurrent Neural Networks (QRNN) for energy consumption forecasting. TCN allows for effective processing of long time series, capturing essential temporal dependencies. Meanwhile, QRNN reduces computational costs through parallelization of operations and an optimized architecture. The effectiveness of the proposed model has been assessed in comparison with traditional methods such as Long Short-Term Memory (LSTM) and Gated Recurrent Unit (GRU) networks, as well as other convolutional approaches. Experimental results show that the proposed TCN-QRNN model outperforms traditional methods by 40% in accuracy compared to LSTM and by 8% in terms of metrics such as Root Mean Squared Error (RMSE), Mean Absolute Error (MAE), and Mean Absolute Percentage Error (MAPE) compared to TCN-LSTM, while reducing data processing time by 30%. Additionally, the model has a significantly smaller number of parameters than LSTM and GRU, making it suitable for environments with limited computational resources. The proposed model ensures a high level of energy consumption forecasting accuracy while significantly reducing processing time, making it promising for use in real-world energy systems.

  • Research Article
  • 10.30871/jaic.v8i2.8704
Ensemble Voting Method for Phonocardiogram Heart Signal Classification Using FFT Features
  • Nov 25, 2024
  • Journal of Applied Informatics and Computing
  • Adisaputra Zidha Noorizki + 2 more

Heart disease is still one of the leading causes of death worldwide, hence the need for effective diagnostic tools. Phonocardiogram (PCG) signals have been explored as a complementary approach to electrocardiogram (ECG) to detect cardiac abnormalities. This research investigates the classification of PCG signals using Fast Fourier Transform (FFT) features and deep learning models, including Long Short-Term Memory (LSTM), Gated Recurrent Unit (GRU), and Temporal Convolutional Network (TCN). Hyperparameter tuning, particularly learning rate adjustment, is applied to optimize the performance of the models. The results show that the GRU and TCN models outperform the LSTM, achieving up to 92% accuracy at a learning rate of 0.0001. Ensemble learning with soft voting was also applied to combine the strengths of each model. Although the ensemble model showed strong performance with 92% accuracy and ROC AUC of 0.9636, it did not provide significant improvement over the base model. This finding highlights the importance of hyperparameter tuning in model optimization, with GRU and TCN showing slightly better performance in the time series classification task. This study concludes that ensemble learning offers stability but does not significantly improve classification accuracy beyond a well-tuned base model.

  • Research Article
  • Cite Count Icon 2
  • 10.3390/jrfm18100551
Deep Learning-Based Hybrid Model with Multi-Head Attention for Multi-Horizon Stock Price Prediction
  • Oct 1, 2025
  • Journal of Risk and Financial Management
  • Rajesh Kumar Ghosh + 3 more

The prediction of stock prices is challenging due to their volatility, irregular patterns, and complex time-series structure. Reliably forecasting stock market data plays a crucial role in minimizing financial risk and optimizing investment strategies. However, traditional models often struggle to capture temporal dependencies and extract relevant features from noisy inputs, which limits their predictive performance. To improve this, we developed an enhanced recursive feature elimination (RFE) method that blends the importance of impurity-based features from random forest and gradient boosting models with Kendall tau correlation analysis, and we applied SHapley Additive exPlanations (SHAP) analysis to externally validate the reliability of the selected features. This approach leads to more consistent and reliable feature selection for short-term stock prediction over 1-, 3-, and 7-day intervals. The proposed deep learning (DL) architecture integrates a temporal convolutional network (TCN) for long-term pattern recognition, a gated recurrent unit (GRU) for sequence capture, and multi-head attention (MHA) for focusing on critical information, thereby achieving superior predictive performance. We evaluate the proposed approach using daily stock price data from three leading companies—HDFC Bank, Tata Consultancy Services (TCS), and Tesla—and two major stock indices: Nifty 50 and S&amp;P 500. The performance of our model is compared against five benchmark models: temporal convolutional network (TCN), long short-term memory (LSTM), GRU, Bidirectional GRU, and a hybrid TCN–GRU model. Our method consistently shows lower error rates and higher predictive accuracy across all datasets, as measured by four commonly used performance metrics.

  • PDF Download Icon
  • Research Article
  • Cite Count Icon 19
  • 10.5194/os-19-1561-2023
Short-term prediction of the significant wave height and average wave period based on the variational mode decomposition–temporal convolutional network–long short-term memory (VMD–TCN–LSTM) algorithm
  • Nov 9, 2023
  • Ocean Science
  • Qiyan Ji + 5 more

Abstract. The present work proposes a prediction model of significant wave height (SWH) and average wave period (APD) based on variational mode decomposition (VMD), temporal convolutional networks (TCNs), and long short-term memory (LSTM) networks. The wave sequence features were obtained using VMD technology based on the wave data from the National Data Buoy Center. Then the SWH and APD prediction models were established using TCNs, LSTM, and Bayesian hyperparameter optimization. The VMD–TCN–LSTM model was compared with the VMD–LSTM (without TCN cells) and LSTM (without VMD and TCN cells) models. The VMD–TCN–LSTM model has significant superiority and shows robustness and generality in different buoy prediction experiments. In the 3 h wave forecasts, VMD primarily improved the model performance, while the TCN had less of an influence. In the 12, 24, and 48 h wave forecasts, both VMD and TCNs improved the model performance. The contribution of the TCN to the improvement of the prediction result determination coefficient gradually increased as the forecasting length increased. In the 48 h SWH forecasts, the VMD and TCN improved the determination coefficient by 132.5 % and 36.8 %, respectively. In the 48 h APD forecasts, the VMD and TCN improved the determination coefficient by 119.7 % and 40.9 %, respectively.

  • Research Article
  • Cite Count Icon 20
  • 10.1016/j.ymssp.2023.110240
New machine learning application platform for spatial–temporal thermal error prediction and control with STFGCN for ball screw system
  • Mar 1, 2023
  • Mechanical Systems and Signal Processing
  • Hongquan Gui + 4 more

New machine learning application platform for spatial–temporal thermal error prediction and control with STFGCN for ball screw system

Save Icon
Up Arrow
Open/Close
Notes

Save Important notes in documents

Highlight text to save as a note, or write notes directly

You can also access these Documents in Paperpal, our AI writing tool

Powered by our AI Writing Assistant