VTnet+Handcrafted based approach for food cuisines classification

Rahul Nijhawan,Garima Sinha,Ashita Batra,Manoj Kumar,Himanshu Sharma

doi:10.1007/s11042-023-15800-4

Abstract

In this paper, we propose a novel hybrid transformer architecture for food cuisine detection and classification. The work carried out within this paper develops a combination of Vision Transformer ensemble architecture with hand-crafted features, thereby making a hybrid Vision Transformer food recognition system. Recently, Vision transformers have been introduced as an alternative means of classification to convolutional neural networks. It performs pattern detection and classification without convolutions and interprets an image as a sequence of patches. The combination of Vision Transformer and hand-crafted features like GIST, HoG (Histogram of Oriented Gradients), and LBP (Local Binary Pattern) were employed on the dataset. The dataset was specifically created (for this work) from the public logging system. It consisted of 13 food categories with 400 images of Indian food items like Ghevar, Idli, Dosa, and much more. It helped to capture a variety of images from every domain and culture. This work made use of the common and readily available food items, which can further be increased by adding on the specialties (dishes) from different regions. Various experiments were performed on CNN with various classifiers like Random forest, and SVM. Further, we compared our proposed approach with several ensembles of CNN architectures. The experiments proved that our proposed approach outperformed the state-of-the-art ensemble CNN architectures for detecting food cuisines. The proposed hybrid approach achieved an accuracy of 94.63%, sensitivity 84.42%, specificity 95.23%, and kappa coefficient 0.93, which was the best amongst all approaches.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Multimedia Tools and Applications	Publication Date: Jun 24, 2023
Citations: 3	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

VTnet+Handcrafted based approach for food cuisines classification

Abstract

Talk to us

Similar Papers

More From: Multimedia Tools and Applications

Lead the way for us

Similar Papers

Automated classification of nasal polyps in endoscopy video-frames using handcrafted and CNN features
Betul Ay ... Galip Aydin
Computers in Biology and Medicine | VOL. 147
Betul Ay, et. al.Betul Ay ... Galip Aydin
13 Jun 2022
Computers in Biology and Medicine | VOL. 147

Hybrid Learning of Hand-Crafted and Deep-Activated Features Using Particle Swarm Optimization and Optimized Support Vector Machine for Tuberculosis Screening
Khin Yadanar Win ... Kazuhiko Hamamoto
Applied Sciences | VOL. 10
Khin Yadanar Win, et. al.Khin Yadanar Win ... Kazuhiko Hamamoto
20 Aug 2020
Applied Sciences | VOL. 10

Combining Deep and Handcrafted Image Features for Presentation Attack Detection in Face Recognition Systems Using Visible-Light Camera Sensors.
Dat Tien Nguyen ... Kang Ryoung Park
Sensors | VOL. 18
Dat Tien Nguyen, et. al.Dat Tien Nguyen ... Kang Ryoung Park
26 Feb 2018
Sensors | VOL. 18

Generally Boosting Few-Shot Learning with HandCrafted Features
Yi Zhang ... Fengtao Zhou
-
Yi Zhang, et. al.Yi Zhang ... Fengtao Zhou
17 Oct 2021
17 Oct 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

VTnet+Handcrafted based approach for food cuisines classification

Abstract

Talk to us

Similar Papers

More From: Multimedia Tools and Applications