Optimal training of integer-valued neural networks with mixed integer programming.

Tómas Thorbjarnarson,Neil Yorke-Smith

doi:10.1371/journal.pone.0261029

Tómas Thorbjarnarson, Neil Yorke-Smith

Open Access

https://doi.org/10.1371/journal.pone.0261029

Copy DOI

Journal: PLOS ONE	Publication Date: Feb 1, 2023
Citations: 3	License type: CC BY 4.0

Affiliation: Delft University of Technology

Abstract

Recent work has shown potential in using Mixed Integer Programming (MIP) solvers to optimize certain aspects of neural networks (NNs). However the intriguing approach of training NNs with MIP solvers is under-explored. State-of-the-art-methods to train NNs are typically gradient-based and require significant data, computation on GPUs, and extensive hyper-parameter tuning. In contrast, training with MIP solvers does not require GPUs or heavy hyper-parameter tuning, but currently cannot handle anything but small amounts of data. This article builds on recent advances that train binarized NNs using MIP solvers. We go beyond current work by formulating new MIP models which improve training efficiency and which can train the important class of integer-valued neural networks (INNs). We provide two novel methods to further the potential significance of using MIP to train NNs. The first method optimizes the number of neurons in the NN while training. This reduces the need for deciding on network architecture before training. The second method addresses the amount of training data which MIP can feasibly handle: we provide a batch training method that dramatically increases the amount of data that MIP solvers can use to train. We thus provide a promising step towards using much more data than before when training NNs using MIP models. Experimental results on two real-world data-limited datasets demonstrate that our approach strongly outperforms the previous state of the art in training NN with MIP, in terms of accuracy, training time and amount of data. Our methodology is proficient at training NNs when minimal training data is available, and at training with minimal memory requirements-which is potentially valuable for deploying to low-memory devices.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Optimal training of integer-valued neural networks with mixed integer programming.

Abstract

Talk to us

Similar Papers

More From: PLOS ONE

Lead the way for us

Similar Papers

Determining ideal fabric cutting times for apparel manufacturing by using mixed integer programming and a heuristic method
Yi Feng Hung ... Chao Ying Chang
European J. of Industrial Engineering | VOL. 8
Yi Feng Hung, et. al.Yi Feng Hung ... Chao Ying Chang
01 Jan 2014
European J. of Industrial Engineering | VOL. 8

Scheduling software updates for connected cars with limited availability
Carlos E Andrade ... Christopher T Volinsky
Applied Soft Computing | VOL. 82
Carlos E Andrade, et. al.Carlos E Andrade ... Christopher T Volinsky
27 Jun 2019
Applied Soft Computing | VOL. 82

Mixed-integer programming models for optimal constellation scheduling given cloud cover uncertainty
Christopher G Valicka ... Lewis Ntaimo
European Journal of Operational Research | VOL. 275
Christopher G Valicka, et. al.Christopher G Valicka ... Lewis Ntaimo
23 Nov 2018
European Journal of Operational Research | VOL. 275

APDCM 2018 Keynote
Yuji Shinano
-
Yuji ShinanoYuji Shinano
01 May 2018
APDCM 2018 Keynote
Yuji Shinano

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Optimal training of integer-valued neural networks with mixed integer programming.

Abstract

Talk to us

Similar Papers

More From: PLOS ONE