Abstract

Salient object detection is used as a pre-process in many computer vision tasks (such as salient object segmentation, video salient object detection, etc.). When performing salient object detection, depth information can provide clues to the location of target objects, so effective fusion of RGB and depth feature information is important. In this paper, we propose a new feature information aggregation approach, weighted group integration (WGI), to effectively integrate RGB and depth feature information. We use a dual-branch structure to slice the input RGB image and depth map separately and then merge the results separately by concatenation. As grouped features may lose global information about the target object, we also make use of the idea of residual learning, taking the features captured by the original fusion method as supplementary information to ensure both accuracy and completeness of the fused information. Experiments on five datasets show that our model performs better than typical existing approaches for four evaluation metrics.

Highlights

  • In recent years, salient object detection (SOD) has attracted widespread interest; it aims to distinguish the most visually obvious objects or regions in a Manuscript received: 2020-08-19; accepted: 2020-11-19 given image

  • To exploit both RGB and depth information, we propose a novel feature integration method, weighted group integration (WGI), that can well employ each category of information

  • For the NJUD dataset, our model achieves the best performance on all four evaluation metrics

Read more

Summary

Introduction

Salient object detection (SOD) has attracted widespread interest; it aims to distinguish the most visually obvious objects or regions in a Manuscript received: 2020-08-19; accepted: 2020-11-19 given image. SOD has been applied to many fields, including content-based image editing [1,2,3,4], image and video compression [5], object segmentation and recognition [6,7,8,9,10], visual tracking [11,12,13], image retrieval [14, 15], etc. As depth cameras (such as Kinect, RealSense, etc.) began to be applied to computer vision, combining the use of depth information for salient object detection, namely RGB-D SOD, becomes a topic of interest

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call