Self-supervised visual representation learning on food images

Andrew W Peng,Fengqing Zhu,Jiangpeng He

doi:10.2352/ei.2023.35.7.image-269

Abstract

Food image classification is the groundwork for image-based dietary assessment, which is the process of monitoring what kinds of food and how much energy is consumed using captured food or eating scene images. Existing deep learning based methods learn the visual representation for food classification based on human annotation of each food image. However, most food images captured from real life are obtained without labels, requiring human annotation to train deep learning based methods. This approach is not feasible for real world deployment due to high costs. To make use of the vast amount of unlabeled images, many existing works focus on unsupervised or self-supervised learning to learn the visual representation directly from unlabeled data. However, none of these existing works focuses on food images, which is more challenging than general objects due to its high inter-class similarity and intra-class variance. In this paper, we focus on two items: the comparison of existing models and the development of an effective self-supervised learning model for food image classification. Specifically, we first compare the performance of existing state-of-the-art self-supervised learning models, including SimSiam, SimCLR, SwAV, BYOL, MoCo, and Rotation Pretext Task on food images. The experiments are conducted on the Food-101 dataset, which contains 101 different classes of foods with 1,000 images in each class. Next, we analyze the unique features of each model and compare their performance on food images to identify the key factors in each model that can help improve the accuracy. Finally, we propose a new model for unsupervised visual representation learning on food images for the classification task.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Self-supervised visual representation learning on food images

Abstract

Talk to us

Similar Papers

More From: Electronic Imaging

Lead the way for us

Journal: Electronic Imaging	Publication Date: Jan 16, 2023
Citations: 1

Similar Papers

Food image classification and image retrieval based on visual features and machine learning
Pengcheng Wei ... Bo Wang
Multimedia Systems | VOL. 28
Pengcheng Wei, et. al.Pengcheng Wei ... Bo Wang
21 Jul 2020
Multimedia Systems | VOL. 28

Conditional synthetic food image generation
Wenjin Fu ... Jiangpeng He
Electronic Imaging | VOL. 35
Wenjin Fu, et. al.Wenjin Fu ... Jiangpeng He
16 Jan 2023
Electronic Imaging | VOL. 35

Incorporating Visual Information in Audio Based Self-Supervised Speaker Recognition
Danwei Cai ... Weiqing Wang
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 30
Danwei Cai, et. al.Danwei Cai ... Weiqing Wang
01 Jan 2021
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 30

Food Image Classification with Deep Features
Abdulkadir Sengur ... Umit Budak
-
Abdulkadir Sengur, et. al.Abdulkadir Sengur ... Umit Budak
01 Sep 2019
01 Sep 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Self-supervised visual representation learning on food images

Abstract

Talk to us

Similar Papers

More From: Electronic Imaging