Reasonable object detection guided by knowledge of global context and category relationship

Haoqin Ji,Kai Ye,Qi Wan,Linlin Shen

doi:10.1016/j.eswa.2022.118285

Abstract

The mainstream object detectors usually treat each region separately, which overlooks the important global context information and the associations between object categories. Existing methods model global context via attention mechanism, which requires ad hoc design and prior knowledge. Some works combine CNN features with label dependencies learned from a pre-defined graph and word embeddings, which ignore the gap between visual features and textual corpus and are usually task-specific (depend on RoIPool/RoIAlign). In order to get rid of the previous specific settings, and enable different types of detectors to refine detection results with the help of prior knowledge, in this paper, we propose KROD (Knowledge-guided Reasonable Object Detection), which consists of the GKM (Global Category Knowledge Mining) module and CRM (Category Relationship Knowledge Mining) module, to improve detection performance by mimicking the processes of human reasoning. For a given image, GKM introduces global category knowledge into the detector by simply attaching a multi-label image classification branch to the backbone. Meanwhile, CRM input the raw detection outputs to the object category co-occurrence based knowledge graph to further refine the original results, with the help of GCN (Graph Convolutional Network). We also propose a novel loss-aware module to distinctively correct the classification probability of different detected boxes. Without bells and whistles, extensive experiments show that the proposed KROD can improve different baseline models (both anchor-based and anchor-free) by a large margin (1.2% ∼ 1.8% higher AP) with marginal loss of efficiency on MS COCO.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Reasonable object detection guided by knowledge of global context and category relationship

Abstract

Talk to us

Similar Papers

More From: Expert Systems with Applications

Lead the way for us

Journal: Expert Systems with Applications	Publication Date: Aug 2, 2022
Citations: 3

Similar Papers

Collaboratively Improving Topic Discovery and Word Embeddings by Coordinating Global and Local Contexts
Guangxu Xun ... Yaliang Li
-
Guangxu Xun, et. al.Guangxu Xun ... Yaliang Li
04 Aug 2017
04 Aug 2017

Tibetan text classification based on graph convolutional network
Xiaotian Xia ... Quan Song
Journal of Physics: Conference Series | VOL. 2577
Xiaotian Xia, et. al.Xiaotian Xia ... Quan Song
01 Aug 2023
Journal of Physics: Conference Series | VOL. 2577

Word Embeddings for Natural Language Processing

-

01 Jan 2015
01 Jan 2015

Attention Mechanism Based on Temporal Graph Convolutional Neural Network for Traffic Flow Prediction
Guang Yang ... Xuan Gu
-
Guang Yang, et. al.Guang Yang ... Xuan Gu
08 Oct 2021
08 Oct 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Reasonable object detection guided by knowledge of global context and category relationship

Abstract

Talk to us

Similar Papers

More From: Expert Systems with Applications