Efficient visual transformer transferring from neural ODE perspective

Hao Niu,Jianyong Wang,Fengming Luo,Yi Zhang,Bo Yuan

doi:10.1049/ell2.70015

Hao Niu, Jianyong Wang + Show 3 more

Open Access

https://doi.org/10.1049/ell2.70015

Copy DOI

Export

Save

Cite

Journal: Electronics Letters	Publication Date: Sep 1, 2024
License type: CC BY-NC-ND 4.0

Abstract
Full-Text
Similar Papers

Abstract

Listen

AbstractRecently, the Visual Image Transformer (ViT) has revolutionized various domains in computer vision. The transfer of pre‐trained ViT models on large‐scale datasets has proven to be a promising method for downstream tasks. However, traditional transfer methods introduce numerous additional parameters in transformer blocks, posing new challenges in learning downstream tasks. This article proposes an efficient transfer method from the perspective of neural Ordinary Differential Equations (ODEs) to address this issue. On the one hand, the residual connections in the transformer layers can be interpreted as the numerical integration of differential equations. Therefore, the transformer block can be described as two explicit Euler method equations. By dynamically learning the step size in the explicit Euler equation, a highly lightweight method for transferring the transformer block is obtained. On the other hand, a new learnable neural memory ODE block is proposed by taking inspiration from the self‐inhibition mechanism in neural systems. It increases the diversity of dynamical behaviours of the neurons to transfer the head block efficiently and enhances non‐linearity simultaneously. Experimental results in image classification demonstrate that the proposed approach can effectively transfer ViT models and outperform state‐of‐the‐art methods.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Efficient visual transformer transferring from neural ODE perspective

Abstract

Published Version

Talk to us

Similar Papers

More From: Electronics Letters

Lead the way for us

Similar Papers

The advance of neural ordinary differential ordinary differential equations
Haoxuan Li
Applied and Computational Engineering | VOL. 6
Haoxuan LiHaoxuan Li
14 Jun 2023
Applied and Computational Engineering | VOL. 6

Accelerating Neural ODEs Using Model Order Reduction.
Mikko Lehtimäki ... Lassi Paunonen
IEEE Transactions on Neural Networks and Learning Systems | VOL. 35
Mikko Lehtimäki, et. al.Mikko Lehtimäki ... Lassi Paunonen
01 Jan 2024
IEEE Transactions on Neural Networks and Learning Systems | VOL. 35

Neural Ordinary Differential Equation based Recurrent Neural Network Model
Mansura Habiba ... Barak A Pearlmutter
-
Mansura Habiba, et. al.Mansura Habiba ... Barak A Pearlmutter
01 Jun 2020
01 Jun 2020

The Essential Tools of Scientific Machine Learning (Scientific ML)
Christopher Rackauckas
-
Christopher RackauckasChristopher Rackauckas
20 Aug 2019
20 Aug 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Efficient visual transformer transferring from neural ODE perspective

Abstract

Published Version

Talk to us

Similar Papers

More From: Electronics Letters