Spatial-Temporal Convolutional Transformer Network for Multivariate Time Series Forecasting.

Lei Huang,Kai Zhang,Zhiheng Li,Feng Mao

doi:10.3390/s22030841

Abstract

Multivariate time series forecasting has long been a research hotspot because of its wide range of application scenarios. However, the dynamics and multiple patterns of spatiotemporal dependencies make this problem challenging. Most existing methods suffer from two major shortcomings: (1) They ignore the local context semantics when modeling temporal dependencies. (2) They lack the ability to capture the spatial dependencies of multiple patterns. To tackle such issues, we propose a novel Transformer-based model for multivariate time series forecasting, called the spatial–temporal convolutional Transformer network (STCTN). STCTN mainly consists of two novel attention mechanisms to respectively model temporal and spatial dependencies. Local-range convolutional attention mechanism is proposed in STCTN to simultaneously focus on both global and local context temporal dependencies at the sequence level, which addresses the first shortcoming. Group-range convolutional attention mechanism is designed to model multiple spatial dependency patterns at graph level, as well as reduce the computation and memory complexity, which addresses the second shortcoming. Continuous positional encoding is proposed to link the historical observations and predicted future values in positional encoding, which also improves the forecasting performance. Extensive experiments on six real-world datasets show that the proposed STCTN outperforms the start-of-the-art methods and is more robust to nonsmooth time series data.

Highlights

Time series forecasting has a wide range of application scenarios in transportation, finance, medical, and other fields
We propose a novel Transformer-based model for multivariate time series forecasting, called the spatial– temporal convolutional Transformer network (STCTN)
The obstacles of applying Transformer to multivariate time series forecasting are that the standard self-attention mechanism is only used at the sequence level and cannot capture the spatial dependencies, and it is weak in capturing the temporal dependencies of multiple patterns

Summary

Introduction

Time series forecasting has a wide range of application scenarios in transportation, finance, medical, and other fields. The development of graph neural networks (GNNs) has brought time series forecasting to a new level and numerous GNN-based methods for spatiotemporal data prediction have been proposed, such as DCRNN [14], STGCN [15], ASTGCN [16], MTGNN [17], STSGCN [18], StemGNN [19], etc. The group-range convolutional attention mechanism uses multihead attention to learn the latent graph structures among multiple time series, extracting dynamic and multimodal spatial dependencies, which addresses the second shortcoming. We design a novel Transformer-based encoder–decoder framework for multivariate time series forecasting that can dynamically model spatiotemporal dependencies. Two novel range convolutional attention mechanisms are proposed to effectively extract dynamic and multimodal spatiotemporal dependencies and reduce the computation complexity.

Related Work

Problem Definition

Self-Attention Mechanism

Local-Range Convolutional Attention

Group-Range Convolutional Attention

Continuous Positional Encoding

Spatial–Temporal Encoder

Spatial–Temporal Decoder

Output Module

Baseline Methods

Experimental Settings

Evaluation Metrics

Results and Analysis

Methods

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Sensors (Basel, Switzerland)	Publication Date: Jan 22, 2022
Citations: 13	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Spatial-Temporal Convolutional Transformer Network for Multivariate Time Series Forecasting.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)

Lead the way for us

Similar Papers

For the aged: A novel PM2.5 concentration forecasting method based on spatial-temporal graph ordinary differential equation networks in home-based care parks
Qingtian Zeng ... Shuihua Wang
Frontiers in Environmental Science | VOL. 10
Qingtian Zeng, et. al.Qingtian Zeng ... Shuihua Wang
24 Aug 2022
Frontiers in Environmental Science | VOL. 10

Graph correlated attention recurrent neural network for multivariate time series forecasting
Xiulin Geng ... Xiaoyu He
Information Sciences | VOL. 606
Xiulin Geng, et. al.Xiulin Geng ... Xiaoyu He
02 May 2022
Information Sciences | VOL. 606

Long-term sequence dependency capture for spatiotemporal graph modeling
Jiangtao Cui ... Longji Huang
Knowledge-Based Systems | VOL. 278
Jiangtao Cui, et. al.Jiangtao Cui ... Longji Huang
26 Jul 2023
Knowledge-Based Systems | VOL. 278

Connecting the Dots: Multivariate Time Series Forecasting with Graph Neural Networks
Xiaojun Chang ... Jing Jiang
-
Xiaojun Chang, et. al.Xiaojun Chang ... Jing Jiang
20 Aug 2020
20 Aug 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Spatial-Temporal Convolutional Transformer Network for Multivariate Time Series Forecasting.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)