MS COCO Research Articles

Currently, real-time detection networks elaborate the technical details of the Faster Regional Convolution Neural Network (R-CNN) recognition pipeline. Within existing R-CNN literature, the evolution exhibited by R-CNN is most profound in terms of computational efficiency integrating each training stage to reduce test time and improvement in mean average precision (mAP), which can be infused into an artificially intelligent (AI), machine learning (ML), real-time, interactive, recovery capital application (app). This article introduces a Region Proposal Network (RPN) that shares full-image convolutional features with a real-time detection AI-ML infused network in an interactive, continuously self-learning wrist-wearable real-time recovery capital app for enabling cost-free region proposals (e.g., instantaneous body physiological responses, mapped connections to emergency services, sponsor, counselor, peer support, links to local and specific recovery capital assets, etc.). A fully merged RPN and Faster R-CNN deep convolutional unified network in the app can simultaneously train to aggregate and predict object bounds and objectness scores for implementing recovery capital real-time solutions (e.g., baseball card scoring dashboards, token-based incentive programs, etc.) A continuous training scheme alternates between fine-tuning RPN tasks (e.g., logging and updating personal client information, gamification orientation) and fine-tuning the detection (e.g., real-time biometric monitoring client’s behavior for self-awareness of when to connect with an addiction specialist or family member, quick response (QR) code registration for a 12-step program, advanced security encryption, etc.) in the interactive app. The very deep VGG-16 model detection system has a frame rate of 5fps within a graphic processing unit (GPU) while accomplishing sophisticated object detection accuracy on PASCAL Visual Object Classification Challenge (PASCAL VOC) and Microsoft Common Objects in Context (MS COCO) datasets. This is achieved with only 300 proposals per real-time retrieved data capture point, information bit or image. The app has real-time, infused cartographic and statistical tracking tools to generate Python Codes, which can enable a gamified addiction recovery-oriented digital conscience. Faster R-CNN and RPN can be the foundations of an interactive real-time recovery capital app that can be adaptable to multiple recovery pathways based on participant recovery plans and actions. This paper discusses some of the critical attributes and features to include in the design of a future app to support and close current gaps in needed recovery capital to help those who are dealing with many different forms of addiction recovery.

Read full abstract

The recent advances in compressing high-accuracy convolutional neural networks (CNNs) have witnessed remarkable progress in real-time object detection. To accelerate detection speed, lightweight detectors always have few convolution layers using a single-path backbone. Single-path architecture, however, involves continuous pooling and downsampling operations, always resulting in coarse and inaccurate feature maps that are disadvantageous to locate objects. On the other hand, due to limited network capacity, recent lightweight networks are often weak in representing large-scale visual data. To address these problems, we present a dual-path network, named DPNet, with a lightweight attention scheme for real-time object detection. The dual-path architecture enables us to extract in parallel high-level semantic features and low-level object details. Although DPNet has a nearly duplicated shape with respect to single-path detectors, the computational costs and model size are not significantly increased. To enhance representation capability, a lightweight self-correlation module (LSCM) is designed to capture global interactions, with only a few computational overheads and network parameters. In the neck, LSCM is extended into a lightweight cross correlation module (LCCM), capturing mutual dependencies among neighboring scale features. We have conducted exhaustive experiments on MS COCO, Pascal VOC 2007, and ImageNet datasets. The experimental results demonstrate that DPNet achieves a state-of-the-art trade off between detection accuracy and implementation efficiency. More specifically, DPNet achieves 31.3% AP on MS COCO test-dev, 82.7% mAP on Pascal VOC 2007 test set, and 41.6% mAP on ImageNet validation set, together with nearly 2.5M model size, 1.04 GFLOPs, and 164 and 196 frames/s (FPS) FPS for [Formula: see text] input images of three datasets.

Read full abstract

MS COCO Research Articles

Articles published on MS COCO

Auxiliary Tasks Enhanced Dual-Affinity Learning for Weakly Supervised Semantic Segmentation.

Closing the Gap on Addiction Recovery Engagement with an AI-infused Convolutional Neural Network Technology Application—A Design Vision

Anti-Adversarially Manipulated Attributions for Weakly Supervised Semantic Segmentation and Object Localization.

Improving the YOLOV7 Algorithm for Object Detection within Recorded Videos

Training Object Detectors from Scratch: An Empirical Study in the Era of Vision Transformer

Visual representation learning using graph-based higher-order heuristic distillation for cell detection in blood smear images

WildARe-YOLO: A lightweight and efficient wild animal recognition model

Composite Backbone Small Object Detection Based on Context and Multi-Scale Information with Attention Mechanism

RVAIC: Refined visual attention for improved image captioning

AdaDet: An Adaptive Object Detection System Based on Early-Exit Neural Networks

Channel-level Matching Knowledge Distillation for object detectors via MSE

Disentangling the intrinsic feature from the related feature in image classification using knowledge distillation and object replacement

Research on dense object detection methods in congested environments of urban streets and roads based on DCYOLO

SDSDet: A real-time object detector for small, dense, multi-scale remote sensing objects

Semi-Supervised Object Detection with Multi-Scale Regularization and Bounding Box Re-Prediction

Spiking Neural Network for Ultralow-Latency and High-Accurate Object Detection.

Joint-Guided Distillation Binary Neural Network via Dynamic Channel-Wise Diversity Enhancement for Object Detection

Negative Deterministic Information-Based Multiple Instance Learning for Weakly Supervised Object Detection and Segmentation.

DPNet: Dual-Path Network for Real-Time Object Detection With Lightweight Attention.

AI-based outdoor moving object detection for smart city surveillance

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

MS COCO Research Articles

Articles published on MS COCO

Auxiliary Tasks Enhanced Dual-Affinity Learning for Weakly Supervised Semantic Segmentation.

Closing the Gap on Addiction Recovery Engagement with an AI-infused Convolutional Neural Network Technology Application—A Design Vision

Anti-Adversarially Manipulated Attributions for Weakly Supervised Semantic Segmentation and Object Localization.

Improving the YOLOV7 Algorithm for Object Detection within Recorded Videos

Training Object Detectors from Scratch: An Empirical Study in the Era of Vision Transformer

Visual representation learning using graph-based higher-order heuristic distillation for cell detection in blood smear images

WildARe-YOLO: A lightweight and efficient wild animal recognition model

Composite Backbone Small Object Detection Based on Context and Multi-Scale Information with Attention Mechanism

RVAIC: Refined visual attention for improved image captioning

AdaDet: An Adaptive Object Detection System Based on Early-Exit Neural Networks

Channel-level Matching Knowledge Distillation for object detectors via MSE

Disentangling the intrinsic feature from the related feature in image classification using knowledge distillation and object replacement

Research on dense object detection methods in congested environments of urban streets and roads based on DCYOLO

SDSDet: A real-time object detector for small, dense, multi-scale remote sensing objects

Semi-Supervised Object Detection with Multi-Scale Regularization and Bounding Box Re-Prediction

Spiking Neural Network for Ultralow-Latency and High-Accurate Object Detection.

Joint-Guided Distillation Binary Neural Network via Dynamic Channel-Wise Diversity Enhancement for Object Detection

Negative Deterministic Information-Based Multiple Instance Learning for Weakly Supervised Object Detection and Segmentation.

DPNet: Dual-Path Network for Real-Time Object Detection With Lightweight Attention.

AI-based outdoor moving object detection for smart city surveillance