Sparse deep neural networks for modeling aluminum electrolysis dynamics

Erlend Torje Berg Lundby,Adil Rasheed,Jan Tommy Gravdahl,Ivar Johan Halvorsen

doi:10.1016/j.asoc.2023.109989

Erlend Torje Berg Lundby, Adil Rasheed + Show 2 more

Open Access

https://doi.org/10.1016/j.asoc.2023.109989

Copy DOI

Abstract

Deep neural networks have become very popular in modeling complex nonlinear processes due to their extraordinary ability to fit arbitrary nonlinear functions from data with minimal expert intervention. However, they are almost always overparameterized and challenging to interpret due to their internal complexity. Furthermore, the optimization process to find the learned model parameters can be unstable due to the process getting stuck in local minima. In this work, we demonstrate the value of sparse regularization techniques to significantly reduce the model complexity. We demonstrate this for the case of an aluminum extraction process, which is highly nonlinear system with many interrelated subprocesses. We trained a densely connected deep neural network to model the process and then compared the effects of sparsity promoting ℓ1 regularization on generalizability, interpretability, and training stability. We found that the regularization significantly reduces model complexity compared to a corresponding dense neural network. We argue that this makes the model more interpretable, and show that training an ensemble of sparse neural networks with different parameter initializations often converges to similar model structures with similar learned input features. Furthermore, the empirical study shows that the resulting sparse models generalize better from small training sets than their dense counterparts.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Soft Computing	Publication Date: Jan 9, 2023
Citations: 10	License type: cc-by

R Discovery Prime

R Discovery Prime

Sparse deep neural networks for modeling aluminum electrolysis dynamics

Abstract

Talk to us

Similar Papers

More From: Applied Soft Computing

Lead the way for us

Similar Papers

Abstract P05: EpistoNet: An ensemble of deep convolutional neural networks using mixture of discriminative experts for detecting COVID-19 on chest X-ray images
Seyed Ziae Mousavi Mojab ... Seyedmohammad Shams
Clinical Cancer Research | VOL. 27
Seyed Ziae Mousavi Mojab, et. al.Seyed Ziae Mousavi Mojab ... Seyedmohammad Shams
12 Mar 2021
Clinical Cancer Research | VOL. 27

FTBME: feature transferring based multi-model ensemble
A Yongquan Yang ... E Zhongxi Zheng
Multimedia Tools and Applications | VOL. 79
A Yongquan Yang, et. al.A Yongquan Yang ... E Zhongxi Zheng
12 Mar 2020
Multimedia Tools and Applications | VOL. 79

Intelligent tuberculosis activity assessment system based on an ensemble of neural networks
Victor Sineglazov ... Nikolai Linnik
Computers in Biology and Medicine | VOL. 147
Victor Sineglazov, et. al.Victor Sineglazov ... Nikolai Linnik
28 Jun 2022
Computers in Biology and Medicine | VOL. 147

Innovations in Neural Information Paradigms and Applications
Lakhmi C Jain
-
Lakhmi C JainLakhmi C Jain
01 Jan 2009
01 Jan 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Sparse deep neural networks for modeling aluminum electrolysis dynamics

Abstract

Talk to us

Similar Papers

More From: Applied Soft Computing