Patch-Aware Sample Selection for Efficient Masked Image Modeling

Zhengyang Zhuge,Yongjun Bao,Yong Li,Peisong Wang,Jian Cheng,Jiaxing Wang

doi:10.1609/aaai.v38i15.29671

Abstract

Nowadays sample selection is drawing increasing attention. By extracting and training only on the most informative subset, sample selection can effectively reduce the training cost. Although sample selection is effective in conventional supervised learning, applying it to Masked Image Modeling (MIM) still poses challenges due to the gap between sample-level selection and patch-level pre-training. In this paper, we inspect the sample selection in MIM pre-training and find the basic selection suffers from performance degradation. We attribute this degradation primarily to 2 factors: the random mask strategy and the simple averaging function. We then propose Patch-Aware Sample Selection (PASS), including a low-cost Dynamic Trained Mask Predictor (DTMP) and Weighted Selection Score (WSS). DTMP consistently masks the informative patches in samples, ensuring a relatively accurate representation of selection score. WSS enhances the selection score using patch-level disparity. Extensive experiments show the effectiveness of PASS in selecting the most informative subset and accelerating pretraining. PASS exhibits superior performance across various datasets, MIM methods, and downstream tasks. Particularly, PASS improves MAE by 0.7% on ImageNet-1K while utilizing only 37% data budget and achieves ~1.7x speedup.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Patch-Aware Sample Selection for Efficient Masked Image Modeling

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Similar Papers

Calibration set reduction by the selection of a subset containing the best fitting samples showing optimally predictive ability
Jan P.M Andries ... Yvan Vander Heyden
Talanta | VOL. 266
Jan P.M Andries, et. al.Jan P.M Andries ... Yvan Vander Heyden
13 Jul 2023
Talanta | VOL. 266

Rethinking Minimal Sufficient Representation in Contrastive Learning
Haoqing Wang ... Zhihong Deng
-
Haoqing Wang, et. al.Haoqing Wang ... Zhihong Deng
01 Jun 2022
01 Jun 2022

Extreme Model Compression for On-device Natural Language Understanding
...
-
, et. al. ...
25 Nov 2020
25 Nov 2020

Extreme Model Compression for On-device Natural Language Understanding
Kanthashree Mysore Sathyendra ... Leah Nicolich-Henkin
-
Kanthashree Mysore Sathyendra, et. al.Kanthashree Mysore Sathyendra ... Leah Nicolich-Henkin
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Patch-Aware Sample Selection for Efficient Masked Image Modeling

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence