A methodology to determine the optimal train-set size for autoencoders applied to energy systems

Piero Danti,Alessandro Innocenti

doi:10.1016/j.aei.2023.102139

Abstract

In the latest years, deep learning has been massively used to face problems that have not been solved by means of classical approaches. In particular, an autoencoder is a popular unsupervised artificial neural network that learns efficient data representations (encoding) by training the network to ignore features with a small content of information. Even though autoencoders over-perform classical techniques in several applications like anomaly detection, dimensionality reduction, features denoising, and missing values imputation, the literature does not provide a commonly accepted methodology to define the optimal amount of data needed to train the model. This paper proposes a procedure to determine the optimal train-set size to minimize the reconstruction error of an autoencoder with pre-defined structure and hyper-parameters that will be trained to encode the normal behavior of energy generation systems. This procedure exploits the outcome of learning curves, a powerful tool to track algorithms performance while the train-set dimension varies. Afterward, the procedure is applied to three real case studies where two types of autoencoders are trained to learn the normal behavior of a YANMAR combined heat and power unit with the scope of detecting incoming anomalies. In the end, the outcomes of the procedure are explained and, under the constraint of a daily retraining frequency, 6 weeks are identified as the optimal train-set size for both autoencoders.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A methodology to determine the optimal train-set size for autoencoders applied to energy systems

Abstract

Talk to us

Similar Papers

More From: Advanced Engineering Informatics

Lead the way for us

Journal: Advanced Engineering Informatics	Publication Date: Aug 22, 2023
Citations: 3

Similar Papers

Solution of Combined Heat and Power Economic Dispatch Problem Using Genetic Algorithm
Dedacus N Ohaegbuchi ... Gabriel Awara
Energy and Power Engineering | VOL. 14
Dedacus N Ohaegbuchi, et. al.Dedacus N Ohaegbuchi ... Gabriel Awara
01 Jan 2021
Energy and Power Engineering | VOL. 14

Truss Optimization Under Frequency Constraints by Using a Combined Differential Evolution and Jaya Algorithm
Sy Nguyen-Van ... Ngoc Nguyen-Dinh
-
Sy Nguyen-Van, et. al.Sy Nguyen-Van ... Ngoc Nguyen-Dinh
24 Nov 2020
24 Nov 2020

Chance-constrained energy and multi-type reserves scheduling exploiting flexibility from combined power and heat units and heat pumps
Jin Tan ... Bo Pan
Energy | VOL. 233
Jin Tan, et. al.Jin Tan ... Bo Pan
14 Jun 2021
Energy | VOL. 233

Enhancing unsupervised neural networks based text summarization with word embedding and ensemble learning
Nabil Alami ... Noureddine En-Nahnahi
Expert Systems with Applications | VOL. 123
Nabil Alami, et. al.Nabil Alami ... Noureddine En-Nahnahi
11 Jan 2019
Expert Systems with Applications | VOL. 123

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A methodology to determine the optimal train-set size for autoencoders applied to energy systems

Abstract

Talk to us

Similar Papers

More From: Advanced Engineering Informatics