Comprehensive Survey of Model Compression and Speed up for Vision Transformers

Feiyang Chen,Xueting Pan,Ziqian Luo,Lisang Zhou,Ying Jiang

doi:10.62836/jitp.v1i1.156

Comprehensive Survey of Model Compression and Speed up for Vision Transformers

Feiyang Chen, Xueting Pan + Show 3 more

https://doi.org/10.62836/jitp.v1i1.156

Copy DOI

Journal: Journal of Information, Technology and Policy	Publication Date: Apr 4, 2024
Citations: 3	License type: CC BY 4.0

#Knowledge Distillation #Low-rank Approximation + Show 8 more

Abstract
Full-Text
Similar Papers

Abstract

Vision Transformers (ViT) have marked a paradigm shift in computer vision, outperforming state-of-the-art models across diverse tasks. However, their practical deployment is hampered by high computational and memory demands. This study addresses the challenge by evaluating four primary model compression techniques: quantization, low-rank approximation, knowledge distillation, and pruning. We methodically analyze and compare the efficacy of these techniques and their combinations in optimizing ViTs for resource-constrained environments. Our comprehensive experimental evaluation demonstrates that these methods facilitate a balanced compromise between model accuracy and computational efficiency, paving the way for wider application in edge computing devices.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: Journal of Information, Technology and Policy

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.