Energy-aware neural architecture selection and hyperparameter optimization

Nathan C Frey,Simon Axelrod,Vijay Gadepally,Rafael Gomez-Bombarelli,David Bestor,Michael Jones,Dan Zhao,Siddharth Samsi

doi:10.1109/ipdpsw55747.2022.00125

Abstract

Artificial Intelligence (AI) and Deep Learning in particular have increasing computational requirements, with a corresponding increase in energy consumption. There is a tremendous opportunity to reduce the computational cost and environmental impact of deep learning by accelerating neural network architecture search and hyperparameter optimization, as well as explicitly designing neural architectures that optimize for both energy efficiency and performance. Here, we introduce a framework called training performance estimation (TPE), which builds upon existing techniques for training speed estimation in order to monitor energy consumption and rank model performance-without training models to convergence-saving up to 90% of time and energy of the full training budget. We benchmark TPE in the computationally intensive, well-studied domain of computer vision and in the emerging field of graph neural networks for machine-learned inter-atomic potentials, an important domain for scientific discovery with heavy computational demands. We propose variants of early stopping that generalize this common regularization technique to account for energy costs and study the energy costs of deploying increasingly complex, knowledge-informed architectures for AI-accelerated molecular dynamics and image classification. Our work enables immediate, significant energy savings across the entire pipeline of model development and deployment and suggests new research directions for energy-aware, knowledge-informed model architecture development.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Energy-aware neural architecture selection and hyperparameter optimization

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Efficient and lightweight convolutional neural network architecture search methods for object classification
Chuen-Horng Lin ... Yung-Kuan Chan
Pattern Recognition | VOL. 156
Chuen-Horng Lin, et. al.Chuen-Horng Lin ... Yung-Kuan Chan
06 Jul 2024
Pattern Recognition | VOL. 156

TA-DARTS: Temperature Annealing of Discrete Operator Distribution for Effective Differential Architecture Search
Jiyong Shin ... Dae-Ki Kang
Applied Sciences | VOL. 13
Jiyong Shin, et. al.Jiyong Shin ... Dae-Ki Kang
08 Sep 2023
Applied Sciences | VOL. 13

A technical view on neural architecture search
Yi-Qi Hu ... Yang Yu
International Journal of Machine Learning and Cybernetics | VOL. 11
Yi-Qi Hu, et. al.Yi-Qi Hu ... Yang Yu
14 Feb 2020
International Journal of Machine Learning and Cybernetics | VOL. 11

AutoML: A survey of the state-of-the-art
Xin He ... Xiaowen Chu
Knowledge-Based Systems | VOL. 212
Xin He, et. al.Xin He ... Xiaowen Chu
24 Nov 2020
Knowledge-Based Systems | VOL. 212

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Energy-aware neural architecture selection and hyperparameter optimization

Abstract

Talk to us

Similar Papers