Abstract

People with blindness or low vision utilize mobile assistive tools for various applications such as object recognition, text recognition, etc. Most of the available applications are focused on recognizing generic objects. And they have not addressed the recognition of food dishes and fruit varieties. In this paper, we propose a smartphone-based system for recognizing the food dishes as well as fruits for children with visual impairments. The Smartphone application utilizes a trained deep CNN model for recognizing the food item from the real-time images. Furthermore, we develop a new deep convolutional neural network (CNN) model for food recognition using the fusion of two CNN architectures. The new deep CNN model is developed using the ensemble learning approach. The deep CNN food recognition model is trained on a customized food recognition dataset.The customized food recognition dataset consists of 29 varieties of food dishes and fruits. Moreover, we analyze the performance of multiple state of art deep CNN models for food recognition using the transfer learning approach. The ensemble model performed better than state of art CNN models and achieved a food recognition accuracy of 95.55 % in the customized food dataset. In addition to that, the proposed deep CNN model is evaluated in two publicly available food datasets to display its efficacy for food recognition tasks.

Highlights

  • The data provided by WHO [1] describes that globally there exist about 285 million visually impaired individuals

  • A laptop with Intel Core i7 -6700 CPU @ 2.60GHZ processor, 24 GB RAM, and 6 GB Nvidia GTX-1060 GPU was used for training the deep convolutional neural network (CNN) models

  • We develop a smartphone-based food recognition application for children with visual impairments

Read more

Summary

Introduction

The data provided by WHO [1] describes that globally there exist about 285 million visually impaired individuals. Various assistive object recognition systems have been proposed for people with visual impairments [6, 17, 26, 32]. Computer vision technology adopts various image processing and machine learning algorithms to recognize the objects from imageries. Tag-based systems [21, 27] utilize visual markers or cues attached to the objects for recognizing them. Non- tag-based systems [7, 9, 16] do not utilize any visual marker or barcodes. Instead, they process the raw imageries and apply various image feature detection algorithms and machine learning algorithms to recognize the objects. Among existing object recognition systems, only a few works are focused on food recognition for people with visual impairments

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call