Food Volume Estimation Based on Deep Learning View Synthesis from a Single Depth Map.

Frank P -W Lo,Benny Lo,Jianing Qiu,Yingnan Sun

doi:10.3390/nu10122005

Abstract

An objective dietary assessment system can help users to understand their dietary behavior and enable targeted interventions to address underlying health problems. To accurately quantify dietary intake, measurement of the portion size or food volume is required. For volume estimation, previous research studies mostly focused on using model-based or stereo-based approaches which rely on manual intervention or require users to capture multiple frames from different viewing angles which can be tedious. In this paper, a view synthesis approach based on deep learning is proposed to reconstruct 3D point clouds of food items and estimate the volume from a single depth image. A distinct neural network is designed to use a depth image from one viewing angle to predict another depth image captured from the corresponding opposite viewing angle. The whole 3D point cloud map is then reconstructed by fusing the initial data points with the synthesized points of the object items through the proposed point cloud completion and Iterative Closest Point (ICP) algorithms. Furthermore, a database with depth images of food object items captured from different viewing angles is constructed with image rendering and used to validate the proposed neural network. The methodology is then evaluated by comparing the volume estimated by the synthesized 3D point cloud with the ground truth volume of the object items.

Highlights

In nutritional epidemiology, detailed food information is required to help dietitians to evaluate the eating behavior of participants
Similar to the method used in previous works of point cloud completion [20,31,32], the holdout method, a simple kind of cross validation, was used to evaluate the performance of the model in which 70% (20 k images per object item) of the rendered depth images were used to train the neural network, while 10% (2.85 k images per object item) and 20% (5.71 k images per object item) of the images with unseen viewing angles were selected as the validation dataset and testing dataset, respectively
An integrated approach based on the depth-sensing technique and deep learning view synthesis is proposed to enable accurate food volume estimation with a single depth image taken in any convenient angles

Summary

Introduction

In nutritional epidemiology, detailed food information is required to help dietitians to evaluate the eating behavior of participants. 24-hour dietary recall (24HR), a dietary assessment method, is commonly used to capture information on the food eaten by the participants. A complete dietary assessment procedure can mainly be divided into four parts: food identification, portion size estimation, nutrient intake calculations, and dietary analysis. When using 24HR, the volume or the portion size of object items relies heavily on participants’ subjective judgement which undoubtedly leads to inaccurate and biased dietary assessment results. It is essential to develop objective dietary assessment techniques to address the problems of inaccuracy and subjective measurements. A variety of computer vision-based techniques have been proposed to tackle the problem of quantifying food portions. The food volume measurement techniques can be divided into two main categories: model-based and stereo-based techniques. Sun et al [3] proposed a virtual reality (VR) approach which utilizes pre-built 3D food models with known volumes for users

Objectives

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Nutrients	Publication Date: Dec 18, 2018
Citations: 61	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Food Volume Estimation Based on Deep Learning View Synthesis from a Single Depth Map.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Nutrients

Lead the way for us

Similar Papers

3D point cloud map based vehicle localization using stereo camera
Yuquan Xu ... Vijay John
-
Yuquan Xu, et. al.Yuquan Xu ... Vijay John
01 Jun 2017
01 Jun 2017

3D LiDAR-based point cloud map registration: Using spatial location of visual features
Minhwan Shin ... Jaeseung Kim
-
Minhwan Shin, et. al.Minhwan Shin ... Jaeseung Kim
01 Dec 2017
01 Dec 2017

An Unsupervised Approach for 3D Face Reconstruction from a Single Depth Image
Peixin Li ... Yicheng Zhong
-
Peixin Li, et. al.Peixin Li ... Yicheng Zhong
01 Jan 2020
01 Jan 2020

WHSP-Net: A Weakly-Supervised Approach for 3D Hand Shape and Pose Recovery from a Single Depth Image.
Jameel Malik ... Ahmed Elhayek
Sensors | VOL. 19
Jameel Malik, et. al.Jameel Malik ... Ahmed Elhayek
31 Aug 2019
Sensors | VOL. 19

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Food Volume Estimation Based on Deep Learning View Synthesis from a Single Depth Map.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Nutrients