Merging of Neural Networks

Martin Pašen,Vladimír Boža

doi:10.1007/s11063-024-11445-y

Merging of Neural Networks

Martin Pašen, Vladimír Boža

Open Access

https://doi.org/10.1007/s11063-024-11445-y

Copy DOI

Journal: Neural Processing Letters	Publication Date: Feb 6, 2024
License type: CC BY 4.0

#Original Ones #Input Network + Show 7 more

Abstract
Full-Text PDF
Similar Papers

Abstract

We propose a simple scheme for merging two neural networks trained with different starting initialization into a single one with the same size as the original ones. We do this by carefully selecting channels from each input network. Our procedure might be used as a finalization step after one tries multiple starting seeds to avoid an unlucky one. We also show that training two networks and merging them leads to better performance than training a single network for an extended period of time.

Full Text