Network For Recognition Research Articles

Recently, visual food analysis has received more and more attention in the computer vision community due to its wide application scenarios, e.g., diet nutrition management, smart restaurant, and personalized diet recommendation. Considering that food images are unstructured images with complex and unfixed visual patterns, mining food-related semantic-aware regions is crucial. Furthermore, the ingredients contained in food images are semantically related to each other due to the cooking habits and have significant semantic relationships with food categories under the hierarchical food classification ontology. Therefore, modeling the long-range semantic relationships between ingredients and the categories-ingredients semantic interactions is beneficial for ingredient recognition and food analysis. Taking these factors into consideration, we propose a multi-task learning framework for food category and ingredient recognition. This framework mainly consists of a food-orient Transformer named Convolution-Enhanced Bi-Branch Adaptive Transformer (CBiAFormer) and a multi-task category-ingredient recognition network called Structural Learning and Cross-Task Interaction (SLCI). In order to capture the complex and unfixed fine-grained patterns of food images, we propose a query-aware data-adaptive attention mechanism called Bi-Branch Adaptive Attention (BiA-Attention) in CBiAFormer, which consists of a local fine-grained branch and a global coarse-grained branch to mine local and global semantic-aware regions for different input images through an adaptive candidate key/value sets assignment for each query. Additionally, a convolutional patch embedding module is proposed to extract the fine-grained features which are neglected by Transformers. To fully utilize the ingredient information, we propose SLCI, which consists of cross-layer attention to model the semantic relationships between ingredients and two cross-task interaction modules to mine the semantic interactions between categories and ingredients. Extensive experiments show that our method achieves competitive performance on three mainstream food datasets (ETH Food-101, Vireo Food-172, and ISIA Food-200). Visualization analyses of CBiAFormer and SLCI on two tasks prove the effectiveness of our method. Codes will be released upon publication. Code and models are available at https://github.com/Liuyuxinict/CBiAFormer.

Caregiver-infant interactions shape infants' early visual experience; however, there is limited work from low-and middle-income countries (LMIC) in characterizing the visual cognitive dynamics of these interactions. Here, we present an innovative dyadic visual cognition pipeline using machine learning methods which captures, processes, and analyses the visual dynamics of caregiver-infant interactions across cultures. We undertake two studies to examine its application in both low (rural India) and high (urban UK) resource settings. Study 1 develops and validates the pipeline to process caregiver-infant interaction data captured using head-mounted cameras and eye-trackers. We use face detection and object recognition networks and validate these tools using 12 caregiver-infant dyads (4 dyads from a 6-month-old UK cohort, 4 dyads from a 6-month-old India cohort, and 4 dyads from a 9-month-old India cohort). Results show robust and accurate face and toy detection, as well as a high percent agreement between processed and manually coded dyadic interactions. Study 2 applied the pipeline to a larger data set (25 6-month-olds from the UK, 31 6-month-olds from India, and 37 9-month-olds from India) with the aim of comparing the visual dynamics of caregiver-infant interaction across the two cultural settings. Results show remarkable correspondence between key measures of visual exploration across cultures, including longer mean look durations during infant-led joint attention episodes. In addition, we found several differences across cultures. Most notably, infants in the UK had a higher proportion of infant-led joint attention episodes consistent with a child-centered view of parenting common in western middle-class families. In summary, the pipeline we report provides an objective assessment tool to quantify the visual dynamics of caregiver-infant interaction across high- and low-resource settings.

Network For Recognition Research Articles

Articles published on Network For Recognition

A Frequency Feature Extraction Method Based on Convolutional Neural Network for Recognition of Incipient Fault

Double Gabor-filter palm vein recognition network

Cell counting via attentive recognition network

Design of non-electrical protection device for 300Mvar-class large-scale phase-regulating units

Lightweight high-performance pose recognition network: HR-LiteNet

Compressing recognition network of cotton disease with spot-adaptive knowledge distillation.

Research on deep learning-based action recognition and quantitative assessment method for sports skills

Practical Innovation of Students’ Civic Education Model Based on Artificial Intelligence Technology

Convolution-Enhanced Bi-Branch Adaptive Transformer With Cross-Task Interaction for Food Category and Ingredient Recognition.

Insights on 'Complex-Valued Iris Recognition Network'

Leveraging technological advances to assess dyadic visual cognition during infancy in high- and low-resource settings.

Enhancing Face Recognition with Detachable Self-Supervised Bypass Networks.

MASTF-net: An EEG Emotion Recognition Network Based on Multi-Source Domain Adaptive Method Based on Spatio-Temporal Image and Frequency Domain Information

CGLF-Net: Image Emotion Recognition Network by Combining Global Self-Attention Features and Local Multiscale Features

Facial age recognition based on deep manifold learning.

IAR-Net: A Human–Object Context Guided Action Recognition Network for Industrial Environment Monitoring

Using Deep Learning-Based Defect Detection and 3D Quantitative Assessment for Steel Deck Pavement Maintenance

Robust Table Structure Recognition Network Based on Local and Global Perspectives

A Hybrid BCI Integrating EEG and Eye-Tracking for Assisting Clinical Communication in Patients With Disorders of Consciousness.

Class Activation Regularization-Based Facial Emotion Recognition Network and Its Application in Students' Emotional Engagement Assessment

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Network For Recognition Research Articles

Articles published on Network For Recognition

A Frequency Feature Extraction Method Based on Convolutional Neural Network for Recognition of Incipient Fault

Double Gabor-filter palm vein recognition network

Cell counting via attentive recognition network

Design of non-electrical protection device for 300Mvar-class large-scale phase-regulating units

Lightweight high-performance pose recognition network: HR-LiteNet

Compressing recognition network of cotton disease with spot-adaptive knowledge distillation.

Research on deep learning-based action recognition and quantitative assessment method for sports skills

Practical Innovation of Students’ Civic Education Model Based on Artificial Intelligence Technology

Convolution-Enhanced Bi-Branch Adaptive Transformer With Cross-Task Interaction for Food Category and Ingredient Recognition.

Insights on 'Complex-Valued Iris Recognition Network'

Leveraging technological advances to assess dyadic visual cognition during infancy in high- and low-resource settings.

Enhancing Face Recognition with Detachable Self-Supervised Bypass Networks.

MASTF-net: An EEG Emotion Recognition Network Based on Multi-Source Domain Adaptive Method Based on Spatio-Temporal Image and Frequency Domain Information

CGLF-Net: Image Emotion Recognition Network by Combining Global Self-Attention Features and Local Multiscale Features

Facial age recognition based on deep manifold learning.

IAR-Net: A Human–Object Context Guided Action Recognition Network for Industrial Environment Monitoring

Using Deep Learning-Based Defect Detection and 3D Quantitative Assessment for Steel Deck Pavement Maintenance

Robust Table Structure Recognition Network Based on Local and Global Perspectives

A Hybrid BCI Integrating EEG and Eye-Tracking for Assisting Clinical Communication in Patients With Disorders of Consciousness.

Class Activation Regularization-Based Facial Emotion Recognition Network and Its Application in Students' Emotional Engagement Assessment