Abstract

Advancements in computer vision have resulted in significant breakthroughs across various applications, and one notable area of progress is in the recognition of food ingredient types and states. The identification of food items, distinguishing between types like oranges or apples, and assessing their states, whether whole, peeled, sliced, or juiced, is a pivotal task with far-reaching implications for fields such as food safety, recipe analysis, and restaurant quality control. This paper introduces an innovative approach to food type and state recognition that capitalizes on attention mechanisms and incorporates mask fusion to improve the accuracy and robustness of the recognition process. We evaluate the proposed approach through quantitative and qualitative analyses and comparisons to previous methods. The results consistently demonstrate that our proposed approach, integrating attention mechanisms, outperforms baseline and state-of-the-art methods, achieving an accuracy of 87.11%. This achievement signifies a step forward in refining food image segmentation models and reinforces the applicability of advanced techniques in real-world scenarios.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.