Smartformer: An intelligent transformer compression framework for time-series modeling

Xiaojian Wang,Yinan Wang,Jin Yang,Ying Chen

doi:10.1080/24725854.2024.2376645

Abstract

Transformer, as one of the cutting-edge deep neural networks (DNNs), has achieved outstanding performance in time-series data analysis. However, this model usually requires large numbers of parameters to fit. Over-parameterization not only brings storage challenges in a resource-limited setting, but also inevitably results in the model over-fitting. Even though literature works introduced several ways to reduce the parameter size of Transformers, none of them addressed this over-parameterized issue by concurrently achieving the following three objectives: preserving the model architecture, maintaining the model performance, and reducing the model complexity (number of parameters). In this study, we propose an intelligent model compression framework, Smartformer, by incorporating reinforcement learning and CP-decomposition techniques to satisfy the aforementioned three objectives. In the experiment, we apply Smartformer and five baseline methods to two existing time-series Transformer models for model compression. The results demonstrate that our proposed Smartformer is the only method that consistently generates the compressed model on various scenarios by satisfying the three objectives. In particular, the Smartformer can mitigate the overfitting issue and thus improve the accuracy of the existing time-series models in all scenarios.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Smartformer: An intelligent transformer compression framework for time-series modeling

Abstract

Talk to us

Similar Papers

More From: IISE Transactions

Lead the way for us

Journal: IISE Transactions	Publication Date: Jul 18, 2024
License type: cc-by

Similar Papers

Efficient statistical significance approximation for local similarity analysis of high-throughput time series data
Li C Xia ... Jed A Fuhrman
Bioinformatics | VOL. 29
Li C Xia, et. al.Li C Xia ... Jed A Fuhrman
23 Nov 2012
Bioinformatics | VOL. 29

Deep Neural Network Algorithm Feedback Model with Behavioral Intelligence and Forecast Accuracy
Taikyeong Jeong
Symmetry | VOL. 12
Taikyeong JeongTaikyeong Jeong
07 Sep 2020
Symmetry | VOL. 12

Triplet Permutation Method for Deep Learning of Single-Shot Person Re-Identification
M.J Gomez-Silva ... A De La Escalera
-
M.J Gomez-Silva, et. al.M.J Gomez-Silva ... A De La Escalera
01 Jan 2019
01 Jan 2019

Countering Acoustic Adversarial Attacks in Microphone-equipped Smart Home Devices
Sourav Bhattacharya ... Stylianos I Venieris
Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies | VOL. 4
Sourav Bhattacharya, et. al.Sourav Bhattacharya ... Stylianos I Venieris
15 Jun 2020
Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies | VOL. 4

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Smartformer: An intelligent transformer compression framework for time-series modeling

Abstract

Talk to us

Similar Papers

More From: IISE Transactions