Parameter-Efficient Model Adaptation for Vision Transformers

Xuehai He,Xin Eric Wang,Chunyuan Li,Jianwei Yang,Pengchuan Zhang

doi:10.1609/aaai.v37i1.25160

Abstract

In computer vision, it has achieved great transfer learning performance via adapting large-scale pretrained vision models (e.g., vision transformers) to downstream tasks. Common approaches for model adaptation either update all model parameters or leverage linear probes. In this paper, we aim to study parameter-efficient model adaptation strategies for vision transformers on the image classification task. We formulate efficient model adaptation as a subspace training problem and perform a comprehensive benchmarking over different efficient adaptation methods. We conduct an empirical study on each efficient model adaptation method focusing on its performance alongside parameter cost. Furthermore, we propose a parameter-efficient model adaptation framework, which first selects submodules by measuring local intrinsic dimensions and then projects them into subspace for further decomposition via a novel Kronecker Adaptation method. We analyze and compare our method with a diverse set of baseline model adaptation methods (including state-of-the-art methods for pretrained language models). Our method performs the best in terms of the tradeoff between accuracy and parameter efficiency across 20 datasets under the few-shot setting and 7 image classification datasets under the full-shot setting.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Parameter-Efficient Model Adaptation for Vision Transformers

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Jun 26, 2023
Citations: 8

Similar Papers

CPT: Cross-Modal Prefix-Tuning for Speech-To-Text Translation
Yukun Ma ... Trung Hieu Nguyen
-
Yukun Ma, et. al.Yukun Ma ... Trung Hieu Nguyen
23 May 2022
23 May 2022

Keratoconus disease classification with multimodel fusion and vision transformer: a pretrained model approach
Shokufeh Yaraghi ... Toktam Khatibi
BMJ Open Ophthalmology | VOL. 9
Shokufeh Yaraghi, et. al.Shokufeh Yaraghi ... Toktam Khatibi
01 Apr 2024
BMJ Open Ophthalmology | VOL. 9

SCIDA: Self-Correction Integrated Domain Adaptation From Single- to Multi-Label Aerial Images
Tianze Yu ... Z Jane Wang
IEEE Transactions on Geoscience and Remote Sensing | VOL. 60
Tianze Yu, et. al.Tianze Yu ... Z Jane Wang
01 Jan 2021
IEEE Transactions on Geoscience and Remote Sensing | VOL. 60

Acoustic analysis and feature transformation from neutral to whisper for speaker identification within whispered speech audio streams
Xing Fan ... John H.L Hansen
Speech Communication | VOL. 55
Xing Fan, et. al.Xing Fan ... John H.L Hansen
24 Aug 2012
Speech Communication | VOL. 55

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Parameter-Efficient Model Adaptation for Vision Transformers

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence