CR-SAM: Curvature Regularized Sharpness-Aware Minimization

Tao Wu,Tie Luo,Donald C Wunsch Ii

doi:10.1609/aaai.v38i6.28431

Abstract

The capacity to generalize to future unseen data stands as one of the utmost crucial attributes of deep neural networks. Sharpness-Aware Minimization (SAM) aims to enhance the generalizability by minimizing worst-case loss using one-step gradient ascent as an approximation. However, as training progresses, the non-linearity of the loss landscape increases, rendering one-step gradient ascent less effective. On the other hand, multi-step gradient ascent will incur higher training cost. In this paper, we introduce a normalized Hessian trace to accurately measure the curvature of loss landscape on both training and test sets. In particular, to counter excessive non-linearity of loss landscape, we propose Curvature Regularized SAM (CR-SAM), integrating the normalized Hessian trace as a SAM regularizer. Additionally, we present an efficient way to compute the trace via finite differences with parallelism. Our theoretical analysis based on PAC-Bayes bounds establishes the regularizer's efficacy in reducing generalization error. Empirical evaluation on CIFAR and ImageNet datasets shows that CR-SAM consistently enhances classification performance for ResNet and Vision Transformer (ViT) models across various datasets. Our code is available at https://github.com/TrustAIoT/CR-SAM.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

CR-SAM: Curvature Regularized Sharpness-Aware Minimization

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Similar Papers

MicroRNA Signature Predicts Survival and Relapse in Lung Cancer
Sung-Liang Yu ...
Cancer Cell | VOL. 13
Sung-Liang Yu, et. al.Sung-Liang Yu ...
01 Jan 2008
Cancer Cell | VOL. 13

Classification of High‐Activity Tiagabine Analogs by Binary QSAR Modeling
Andreas Jurik ... Gerhard F Ecker
Molecular Informatics | VOL. 32
Andreas Jurik, et. al.Andreas Jurik ... Gerhard F Ecker
15 May 2013
Molecular Informatics | VOL. 32

Development and validation of prognostic nomogram for malignant pleural mesothelioma
S Y Liu ... D Han
Zhonghua zhong liu za zhi [Chinese journal of oncology] | VOL. 45
S Y Liu, et. al.S Y Liu ... D Han
23 May 2023
Zhonghua zhong liu za zhi [Chinese journal of oncology] | VOL. 45

S410 Algorithm Training and Independent Test Set Performance for a Molecular Non-Endoscopic Test for Detection of Esophageal Adenocarcinoma and Barrett’s Esophagus in Multicenter Cohorts
Prasad G Iyer ...
American Journal of Gastroenterology | VOL. 117
Prasad G Iyer, et. al.Prasad G Iyer ...
01 Oct 2022
American Journal of Gastroenterology | VOL. 117

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

CR-SAM: Curvature Regularized Sharpness-Aware Minimization

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence