Human-object Interaction Research Articles

Object detection and action/gesture recognition have become imperative in security and surveillance fields, finding extensive applications in everyday life. Advancement in such technologies will help in furthering cybersecurity and extended reality systems through the accurate identification of users and their interactions, which plays a pivotal role in the security management of an entity and providing an immersive experience. Essentially, it enables the identification of human–object interaction to track actions and behaviors along with user identification. Yet, it is performed by traditional camera-based methods with high difficulties and challenges since occlusion, different camera viewpoints, and background noise lead to significant appearance variation. Deep learning techniques also demand large and labeled datasets and a large amount of computational power. In this paper, a novel approach to the recognition of human–object interactions and the identification of interacting users is proposed, based on three-dimensional hand pose data from an egocentric camera view. A multistage approach that integrates object detection with interaction recognition and user identification using the data from hand joints and vertices is proposed. Our approach uses a statistical attribute-based model for feature extraction and representation. The proposed technique is tested on the HOI4D dataset using the XGBoost classifier, achieving an average F1-score of 81% for human–object interaction and an average F1-score of 80% for user identification, hence proving to be effective. This technique is mostly targeted for extended reality systems, as proper interaction recognition and users identification are the keys to keeping systems secure and personalized. Its relevance extends into cybersecurity, augmented reality, virtual reality, and human–robot interactions, offering a potent solution for security enhancement along with enhancing interactivity in such systems.

Read full abstract

This paper focuses on Human–Object Interaction (HOI) detection, addressing the challenge of identifying and understanding the interactions between humans and objects within a given image or video frame. Spearheaded by Detection Transformer (DETR), recent developments lead to significant improvements by replacing traditional region proposals by a set of learnable queries. However, despite the powerful representation capabilities provided by Transformers, existing Human–Object Interaction (HOI) detection methods still yield low confidence levels when dealing with complex interactions and are prone to overlooking interactive actions. To address these issues, we propose a novel approach UAHOI, Uncertainty-aware Robust Human–Object Interaction Learning that explicitly estimates prediction uncertainty during the training process to refine both detection and interaction predictions. Our model not only predicts the HOI triplets but also quantifies the uncertainty of these predictions. Specifically, we model this uncertainty through the variance of predictions and incorporate it into the optimization objective, allowing the model to adaptively adjust its confidence threshold based on prediction variance. This integration helps in mitigating the adverse effects of incorrect or ambiguous predictions that are common in traditional methods without any hand-designed components, serving as an automatic confidence threshold. Our method is flexible to existing HOI detection methods and demonstrates improved accuracy. We evaluate UAHOI on two standard benchmarks in the field: V-COCO and HICO-DET, which represent challenging scenarios for HOI detection. Through extensive experiments, we demonstrate that UAHOI achieves significant improvements over existing state-of-the-art methods, enhancing both the accuracy and robustness of HOI detection.

Read full abstract

Human-object Interaction Research Articles

Related Topics

Articles published on Human-object Interaction

HOIMotion: Forecasting Human Motion During Human-Object Interactions Using Egocentric 3D Object Bounding Boxes.

Human–Object Interaction Detection: An Overview

Auxiliary Feature Fusion and Noise Suppression for HOI Detection

Using 3D Hand Pose Data in Recognizing Human–Object Interaction and User Identification for Extended Reality Systems

CHA: Conditional Hyper-Adapter method for detecting human–object interaction

PPDM++: Parallel Point Detection and Matching for Fast and Accurate HOI Detection.

HOI-V: One-stage human-object interaction detection based on multi-feature fusion in videos

Adaptive multimodal prompt for human-object interaction with local feature enhanced transformer

Learning 3D human–object interaction graphs from transferable context knowledge for construction monitoring

Deep scene understanding with extended text description for human object interaction detection

Human–object interaction detection algorithm based on graph structure and improved cascade pyramid network

Exploring the synergy between textual identity and visual signals in human-object interaction

Semantic-Aware Dynamic Generation Networks for Few-Shot Human-Object Interaction Recognition.

Dynamic human–object interaction detection for feature exclusion in visual simultaneous localization and mapping (SLAM)

Human–object interaction detection via recycling of ground-truth annotations

LiDARCapV2: 3D human pose estimation with human–object interaction from LiDAR point clouds

Human object interaction detection in paintings using multi-task learning

UAHOI: Uncertainty-aware robust interaction learning for HOI detection

A Novel Part Refinement Tandem Transformer for Human-Object Interaction Detection.

Human–Object Interaction: Development of a Usability Index for Product Design Using a Hierarchical Fuzzy Axiomatic Design

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Human-object Interaction Research Articles

Related Topics

Articles published on Human-object Interaction

HOIMotion: Forecasting Human Motion During Human-Object Interactions Using Egocentric 3D Object Bounding Boxes.

Human–Object Interaction Detection: An Overview

Auxiliary Feature Fusion and Noise Suppression for HOI Detection

Using 3D Hand Pose Data in Recognizing Human–Object Interaction and User Identification for Extended Reality Systems

CHA: Conditional Hyper-Adapter method for detecting human–object interaction

PPDM++: Parallel Point Detection and Matching for Fast and Accurate HOI Detection.

HOI-V: One-stage human-object interaction detection based on multi-feature fusion in videos

Adaptive multimodal prompt for human-object interaction with local feature enhanced transformer

Learning 3D human–object interaction graphs from transferable context knowledge for construction monitoring

Deep scene understanding with extended text description for human object interaction detection

Human–object interaction detection algorithm based on graph structure and improved cascade pyramid network

Exploring the synergy between textual identity and visual signals in human-object interaction

Semantic-Aware Dynamic Generation Networks for Few-Shot Human-Object Interaction Recognition.

Dynamic human–object interaction detection for feature exclusion in visual simultaneous localization and mapping (SLAM)

Human–object interaction detection via recycling of ground-truth annotations

LiDARCapV2: 3D human pose estimation with human–object interaction from LiDAR point clouds

Human object interaction detection in paintings using multi-task learning

UAHOI: Uncertainty-aware robust interaction learning for HOI detection

A Novel Part Refinement Tandem Transformer for Human-Object Interaction Detection.

Human–Object Interaction: Development of a Usability Index for Product Design Using a Hierarchical Fuzzy Axiomatic Design