Statistical Feature Construction for Forecasting Accuracy Increase and Its Applications in Neural Network Based Analysis

Andrey Gorshenin,Victor Kuzmin

doi:10.3390/math10040589

Andrey Gorshenin, Victor Kuzmin

Open Access

https://doi.org/10.3390/math10040589

Copy DOI

Abstract

This paper presents a feature construction approach called Statistical Feature Construction (SFC) for time series prediction. Creation of new features is based on statistical characteristics of analyzed data series. First, the initial data are transformed into an array of short pseudo-stationary windows. For each window, a statistical model is created and characteristics of these models are later used as additional features for a single window or as time-dependent features for the entire time series. To demonstrate the effect of SFC, five plasma physics and six oceanographic time series were analyzed. For each window, unknown distribution parameters were estimated with the method of moving separation of finite normal mixtures. First four statistical moments of these mixtures for initial data and increments were used as additional data features. Multi-layer recurrent neural networks were trained to create short- and medium-term forecasts with a single window as input data; additional features were used to initialize the hidden state of recurrent layers. A hyperparameter grid-search was performed to compare fully-optimized neural networks for original and enriched data. A significant decrease in RMSE metric was observed with a median of 11.4%. There was no increase in RMSE metric in any of the analyzed time series. The experimental results have shown that SFC can be a valuable method for forecasting accuracy improvement.

Highlights

Forecasting of real-world processes can be limited by the amount of information that can be reasonably collected
The choice of LSTM recurrent layers provided for better results than the use of Gated Recurrent Units (GRU)/RNN layers
The paper presents a statistical approach to data modeling and feature construction with applications for two different sets of data

Summary

Introduction

Forecasting of real-world processes can be limited by the amount of information that can be reasonably collected. These stated conditions call for the need for research of probability mixture models for distributions of the observed processes [1]. A wide class of distributions with the form of H(x) = EP[F(x, y)] is usually chosen as the base family [2,3]. EP denotes the mathematical expectation with respect to some probability measure P, which defines a mixing distribution. It is usually determined through the analysis of external factors behavior. F(x, y) is a distribution function with a random vector of parameters y that is called a mixing (kernel) distribution

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Mathematics	Publication Date: Feb 14, 2022
Citations: 8	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Statistical Feature Construction for Forecasting Accuracy Increase and Its Applications in Neural Network Based Analysis

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Mathematics

Lead the way for us

Similar Papers

Using deep learning–derived image features in radiologic time series to make personalised predictions: proof of concept in colonic transit data
Brendan S Kelly ... Ronan P Killeen
European Radiology | VOL. 33
Brendan S Kelly, et. al.Brendan S Kelly ... Ronan P Killeen
07 Jun 2023
European Radiology | VOL. 33

Increasing the accuracy of forecasting the electricity consumption of an industrial enterprise by machine learning methods using the selection of significant features from a time series
N N Sergeev ... P V Matrenin
iPolytech Journal | VOL. 26
N N Sergeev, et. al.N N Sergeev ... P V Matrenin
08 Oct 2022
iPolytech Journal | VOL. 26

Meta-learning with neural networks and landmarking for forecasting model selection an empirical evaluation of different feature sets applied to industry data
Mirko Kuck ... Michael Freitag
-
Mirko Kuck, et. al.Mirko Kuck ... Michael Freitag
01 Jul 2016
01 Jul 2016

Prediction of InSAR deformation time-series using a long short-term memory neural network
Yi Chen ... Liya Gao
International Journal of Remote Sensing | VOL. 42
Yi Chen, et. al.Yi Chen ... Liya Gao
07 Jul 2021
International Journal of Remote Sensing | VOL. 42

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Statistical Feature Construction for Forecasting Accuracy Increase and Its Applications in Neural Network Based Analysis

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Mathematics