Achievable Detection Performance Research Articles

Abstract The availability of data is limited in some fields, especially for object detection tasks, where it is necessary to have correctly labeled bounding boxes around each object. A notable example of such data scarcity is found in the domain of marine biology, where it is useful to develop methods to automatically detect submarine species for environmental monitoring. To address this data limitation, the state-of-the-art machine learning strategies employ two main approaches. The first involves pretraining models on existing datasets before generalizing to the specific domain of interest. The second strategy is to create synthetic datasets specifically tailored to the target domain using methods like copy-paste techniques or ad-hoc simulators. The first strategy often faces a significant domain shift, while the second demands custom solutions crafted for the specific task. In response to these challenges, here we propose a transfer learning framework that is valid for a generic scenario. In this framework, generated images help to improve the performances of an object detector in a few-real data regime. This is achieved through a diffusion-based generative model that was pretrained on large generic datasets. With respect to the state-of-the-art, we find that it is not necessary to fine tune the generative model on the specific domain of interest. We believe that this is an important advance because it mitigates the labor-intensive task of manual labeling the images in object detection tasks. We validate our approach focusing on fishes in an underwater environment, and on the more common domain of cars in an urban setting. Our method achieves detection performance comparable to models trained on thousands of images, using only a few hundreds of input data. Our results pave the way for new generative AI-based protocols for machine learning applications in various domains, for instance ranging from geophysics to biology and medicine.

Read full abstract

Conventional open-world object detection (OWOD) problem setting first distinguishes known and unknown classes and then later incrementally learns the unknown objects when introduced with labels in the subsequent tasks. However, the current OWOD formulation heavily relies on the external human oracle for knowledge input during the incremental learning stages. Such reliance on run-time makes this formulation less realistic in a real-world deployment. To address this, we introduce a more realistic formulation, named semi-supervised open-world detection (SS-OWOD), that reduces the annotation cost by casting the incremental learning stages of OWOD in a semi-supervised manner. We demonstrate that the performance of the state-of-the-art OWOD detector dramatically deteriorates in the proposed SS-OWOD setting. Therefore, we introduce a novel SS-OWOD detector, named SS-OWFormer, that utilizes a feature-alignment scheme to better align the object query representations between the original and augmented images to leverage the large unlabeled and few labeled data. We further introduce a pseudo-labeling scheme for unknown detection that exploits the inherent capability of decoder object queries to capture object-specific information. On the COCO dataset, our SS-OWFormer using only 50% of the labeled data achieves detection performance that is on par with the state-of-the-art (SOTA) OWOD detector using all the 100% of labeled data. Further, our SS-OWFormer achieves an absolute gain of 4.8% in unknown recall over the SOTA OWOD detector. Lastly, we demonstrate the effectiveness of our SS-OWOD problem setting and approach for remote sensing object detection, proposing carefully curated splits and baseline performance evaluations. Our experiments on 4 datasets including MS COCO, PASCAL, Objects365 and DOTA demonstrate the effectiveness of our approach. Our source code, models and splits are available here https://github.com/sahalshajim/SS-OWFormer

Read full abstract

Achievable Detection Performance Research Articles

Related Topics

Articles published on Achievable Detection Performance

Transfer learning with generative models for object detection on limited datasets

YOLIC: An efficient method for object localization and classification on edge devices

Semi-supervised Open-World Object Detection

SSPT-bpMRI: A Self-supervised Pre-training Scheme for Improving Prostate Cancer Detection and Diagnosis in Bi-parametric MRI.

High mobility enabled spatial and media‐based modulated orthogonal frequency division multiplexing systems for beyond 5G wireless communications

Scale-adaptive local intentional surface feature detection

Spatial Diversity in Radar Detection via Active Reconfigurable Intelligent Surfaces

Remote interference management in 5G new radio: methods and performance

Soft Output Signal Detection for Massive MIMO Systems Based on Chebyshev Trace Iteration

Exploring a Multimodal Mixture-Of-YOLOs Framework for Advanced Real-Time Object Detection

Adaptive Bayesian Detection for MIMO Radar in Gaussian Clutter

A low-complexity photoplethysmographic systolic peak detector for compressed sensed data

Detection of Sparse Stochastic Signals With Quantized Measurements in Sensor Networks

Bayesian Detection for MIMO Radar in Gaussian Clutter

Temporal Action Detection in Untrimmed Videos from Fine to Coarse Granularity

Secrecy Constrained Distributed Detection in Sensor Networks

Detection Performance of a Forward Scatter Radar Using a Crystal Video Detector

Hidden Markov Model Based Signal Characterization for Weak Light Communication

Massive MIMO for Distributed Detection With Transceiver Impairments

Random-vibration-based damage detection and precise localization on a lab–scale aircraft stabilizer structure via the Generalized Functional Model Based Method

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Achievable Detection Performance Research Articles

Related Topics

Articles published on Achievable Detection Performance

Transfer learning with generative models for object detection on limited datasets

YOLIC: An efficient method for object localization and classification on edge devices

Semi-supervised Open-World Object Detection

SSPT-bpMRI: A Self-supervised Pre-training Scheme for Improving Prostate Cancer Detection and Diagnosis in Bi-parametric MRI.

High mobility enabled spatial and media‐based modulated orthogonal frequency division multiplexing systems for beyond 5G wireless communications

Scale-adaptive local intentional surface feature detection

Spatial Diversity in Radar Detection via Active Reconfigurable Intelligent Surfaces

Remote interference management in 5G new radio: methods and performance

Soft Output Signal Detection for Massive MIMO Systems Based on Chebyshev Trace Iteration

Exploring a Multimodal Mixture-Of-YOLOs Framework for Advanced Real-Time Object Detection

Adaptive Bayesian Detection for MIMO Radar in Gaussian Clutter

A low-complexity photoplethysmographic systolic peak detector for compressed sensed data

Detection of Sparse Stochastic Signals With Quantized Measurements in Sensor Networks

Bayesian Detection for MIMO Radar in Gaussian Clutter

Temporal Action Detection in Untrimmed Videos from Fine to Coarse Granularity

Secrecy Constrained Distributed Detection in Sensor Networks

Detection Performance of a Forward Scatter Radar Using a Crystal Video Detector

Hidden Markov Model Based Signal Characterization for Weak Light Communication

Massive MIMO for Distributed Detection With Transceiver Impairments

Random-vibration-based damage detection and precise localization on a lab–scale aircraft stabilizer structure via the Generalized Functional Model Based Method