Object Bounding Box Research Articles

In still image human action recognition, existing studies have mainly leveraged extra bounding box information along with class labels to mitigate the lack of temporal information in still images. However, preparing additional annotations such as human and object bounding boxes is time-consuming and also prone to human errors because these annotations are prepared manually. In this paper, we propose a two-phase multi-expert classification method for human action recognition by means of super-class learning and without any extra information. Specifically, a coarse-grained phase selects the most relevant fine-grained experts. Then, the fine-grained experts encode the intricate details within each super-class so that the inter-class variation increases. In the proposed approach, to choose the best configuration for each super-class and characterize inter-class dependency between different action classes, we propose a novel Graph-Based Class Selection (GCS) algorithm. Moreover, the proposed method copes with long-tailed distribution, which the existing studies have not addressed in action recognition. Extensive experimental evaluations are conducted on various public human action recognition datasets, including Stanford40, Pascal VOC 2012 Action, BU101+, and IHAR datasets. The experimental results demonstrate that the proposed method yields promising improvements. To be more specific, in IHAR, Sanford40, Pascal VOC 2012 Action, and BU101+ benchmarks, the proposed approach outperforms the state-of-the-art studies by 8.92%, 0.41%, 0.66%, and 2.11% with much less computational cost and without any auxiliary annotation information. Besides, it is proven that in addressing action recognition with long-tailed distribution, the proposed method outperforms its counterparts by a significant margin.

Read full abstract

In the face of Synthetic Aperture Radar (SAR) image object detection with different distributions of training and test data, traditional supervised learning methods cannot achieve good detection performance. Domain adaptation (DA) method has been shown to have the ability to solve this problem, but existing DA object detection algorithms all use adversarial DA theory for the detection task, which is ineffective in solving object regression localization in the detection task. In this article, to better solve the above problem, an automatic SAR image ship detection method based on feature decomposition crossing different satellites is proposed. The feature extraction layer of backbone network is divided into low level and high level, where domain-invariant feature extractors are designed for the local features extracted from the low level and the global features extracted from the high level, respectively. We argue that the local and global features extracted from source domain and target domain contain domain-specific features (DSF) for adversarial DA and domain-invariant features (DIF) that contribute to object regression localization. Then, we decompose the local features and global features into DSF and DIF via vector decomposition method. For DSF counterpart, we introduce adversarial DA attention for feature alignment. DIF from the local features are fused into the backbone network for high-level global feature extraction. Finally, by using region proposal network and adversarial domain classifier, we can get the accurate bounding box and object class of SAR image objects. Extensive experiments prove that the proposed method outperforms state-of-the-art methods in terms of detection performance.

Read full abstract

Object Bounding Box Research Articles

Related Topics

Articles published on Object Bounding Box

Fast and Robust Visual Tracking with Few-Iteration Meta-Learning

Pillar-Based Cooperative Perception from Point Clouds for 6G-Enabled Cooperative Autonomous Vehicles

Adaptive Fusion CNN Features for RGBT Object Tracking

DEEP LEARNING-BASED TRACKING OF MULTIPLE OBJECTS IN THE CONTEXT OF FARM ANIMAL ETHOLOGY

Multi-expert human action recognition with hierarchical super-class learning

2차원 영상에서의 깊이 추정을 위한 객체 인식 제어 방법

Adaptive depth-aware visual relationship detection

Autonomous Recognition of Multiple Surgical Instruments Tips Based on Arrow OBB-YOLO Network

A Feature Decomposition-Based Method for Automatic Ship Detection Crossing Different Satellite SAR Images

Object-Agnostic Transformers for Video Referring Segmentation.

Multivehicle Object Tracking in Satellite Video Enhanced by Slow Features and Motion Features

Robotic Objects Detection and Grasping in Clutter Based on Cascaded Deep Convolutional Neural Network

Object-wise Secure Image Display Method for Screen Capture Protection

AHDet: A dynamic coarse-to-fine gaze strategy for active object detection

Occlusion-robust online multi-object visual tracking using a GM-PHD filter with CNN-based re-identification

АЛГОРИТМ КОМПЛЕКСИРОВАНИЯ НЕСКОЛЬКИХ ИСТОЧНИКОВ ДАННЫХ В ОБЩУЮ КАРТУ ЗАНЯТОСТИ

MOHE-NET: MONOCULAR OBJECT HEIGHT ESTIMATION NETWORK USING DEEP LEARNING AND SCENE GEOMETRY

Visual Relation Detection using Hybrid Analogical Learning

Co-Training for Deep Object Detection: Comparing Single-Modal and Multi-Modal Approaches.

Augmented Reality Appling with Consistency of Behavior using Oriented Bounding Box Algorithm

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Object Bounding Box Research Articles

Related Topics

Articles published on Object Bounding Box

Fast and Robust Visual Tracking with Few-Iteration Meta-Learning

Pillar-Based Cooperative Perception from Point Clouds for 6G-Enabled Cooperative Autonomous Vehicles

Adaptive Fusion CNN Features for RGBT Object Tracking

DEEP LEARNING-BASED TRACKING OF MULTIPLE OBJECTS IN THE CONTEXT OF FARM ANIMAL ETHOLOGY

Multi-expert human action recognition with hierarchical super-class learning

2차원 영상에서의 깊이 추정을 위한 객체 인식 제어 방법

Adaptive depth-aware visual relationship detection

Autonomous Recognition of Multiple Surgical Instruments Tips Based on Arrow OBB-YOLO Network

A Feature Decomposition-Based Method for Automatic Ship Detection Crossing Different Satellite SAR Images

Object-Agnostic Transformers for Video Referring Segmentation.

Multivehicle Object Tracking in Satellite Video Enhanced by Slow Features and Motion Features

Robotic Objects Detection and Grasping in Clutter Based on Cascaded Deep Convolutional Neural Network

Object-wise Secure Image Display Method for Screen Capture Protection

AHDet: A dynamic coarse-to-fine gaze strategy for active object detection

Occlusion-robust online multi-object visual tracking using a GM-PHD filter with CNN-based re-identification

АЛГОРИТМ КОМПЛЕКСИРОВАНИЯ НЕСКОЛЬКИХ ИСТОЧНИКОВ ДАННЫХ В ОБЩУЮ КАРТУ ЗАНЯТОСТИ

MOHE-NET: MONOCULAR OBJECT HEIGHT ESTIMATION NETWORK USING DEEP LEARNING AND SCENE GEOMETRY

Visual Relation Detection using Hybrid Analogical Learning

Co-Training for Deep Object Detection: Comparing Single-Modal and Multi-Modal Approaches.

Augmented Reality Appling with Consistency of Behavior using Oriented Bounding Box Algorithm