Autocleandeepfood: auto-cleaning and data balancing transfer learning for regional gastronomy food computing

Nauman Ullah Gilal,Marco Agus,Marwa Qaraqe,Jens Schneider

doi:10.1007/s00371-024-03560-7

Abstract

AbstractFood computing has emerged as a promising research field, employing artificial intelligence, deep learning, and data science methodologies to enhance various stages of food production pipelines. To this end, the food computing community has compiled a variety of data sets and developed various deep-learning architectures to perform automatic classification. However, automated food classification presents a significant challenge, particularly when it comes to local and regional cuisines, which are often underrepresented in available public-domain data sets. Nevertheless, obtaining high-quality, well-labeled, and well-balanced real-world labeled images is challenging since manual data curation requires significant human effort and is time-consuming. In contrast, the web has a potentially unlimited source of food data but tapping into this resource has a good chance of corrupted and wrongly labeled images. In addition, the uneven distribution among food categories may lead to data imbalance problems. All these issues make it challenging to create clean data sets for food from web data. To address this issue, we present AutoCleanDeepFood, a novel end-to-end food computing framework for regional gastronomy that contains the following components: (i) a fully automated pre-processing pipeline for custom data sets creation related to specific regional gastronomy, (ii) a transfer learning-based training paradigm to filter out noisy labels through loss ranking, incorporating a Russian Roulette probabilistic approach to mitigate data imbalance problems, and (iii) a method for deploying the resulting model on smartphones for real-time inferences. We assess the performance of our framework on a real-world noisy public domain data set, ETH Food-101, and two novel web-collected datasets, MENA-150 and Pizza-Styles. We demonstrate the filtering capabilities of our proposed method through embedding visualization of the feature space using the t-SNE dimension reduction scheme. Our filtering scheme is efficient and effectively improves accuracy in all cases, boosting performance by 0.96, 0.71, and 1.29% on MENA-150, ETH Food-101, and Pizza-Styles, respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Autocleandeepfood: auto-cleaning and data balancing transfer learning for regional gastronomy food computing

Abstract

Talk to us

Similar Papers

More From: The Visual Computer

Lead the way for us

Journal: The Visual Computer	Publication Date: Jul 9, 2024
License type: CC BY 4.0

Similar Papers

MixNN: Combating Noisy Labels in Deep Learning by Mixing with Nearest Neighbors
Yangdi Lu ... Wenbo He
-
Yangdi Lu, et. al.Yangdi Lu ... Wenbo He
15 Dec 2021
15 Dec 2021

Iterative Learning with Open-set Noisy Labels
Yisen Wang ... Hongyuan Zha
-
Yisen Wang, et. al.Yisen Wang ... Hongyuan Zha
01 Jun 2018
01 Jun 2018

Robust seismic data denoising via self-supervised deep learning
Ji Li ... Dawei Liu
GEOPHYSICS | VOL. 89
Ji Li, et. al.Ji Li ... Dawei Liu
19 Aug 2024
GEOPHYSICS | VOL. 89

A Comparative Systematic Literature Review on Knee Bone Reports from MRI, X-rays and CT Scans Using Deep Learning and Machine Learning Methodologies.
Hafsa Khalid ... Khadija Khalid
Diagnostics | VOL. 10
Hafsa Khalid, et. al.Hafsa Khalid ... Khadija Khalid
26 Jul 2020
Diagnostics | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Autocleandeepfood: auto-cleaning and data balancing transfer learning for regional gastronomy food computing

Abstract

Talk to us

Similar Papers

More From: The Visual Computer