One For All: A Mutual Enhancement Method for Object Detection and Semantic Segmentation

Shichao Zhang,Libo Sun,Wenhu Qin,Zhe Zhang

doi:10.3390/app10010013

Abstract

Generally, most approaches using methods such as cropping, rotating, and flipping achieve more data to train models for improving the accuracy of detection and segmentation. However, due to the difficulties of labeling such data especially semantic segmentation data, those traditional data augmentation methodologies cannot help a lot when the training set is really limited. In this paper, a model named OFA-Net (One For All Network) is proposed to combine object detection and semantic segmentation tasks. Meanwhile, using a strategy called “1-N Alternation” to train the OFA-Net model, which can make a fusion of features from detection and segmentation data. The results show that object detection data can be recruited to better the segmentation accuracy performance, and furthermore, segmentation data assist a lot to enhance the confidence of predictions for object detection. Finally, the OFA-Net model is trained without traditional data augmentation methodologies and tested on the KITTI test server. The model works well on the KITTI Road Segmentation challenge and can do a good job on the object detection task.

Highlights

In recent years, convolutional networks (ConvNets) contributed a lot to the dramatic improvements in computer vision-related tasks
This paper proposes a model called One for All Network (OFA-Net) (One For All, which means One model For All results required) to do driving environment images road segmentation and object detection tasks
This paper shows that by mixing object detection data with segmentation data using our “1-N Alternation” strategy, this unified multi-task learning [29] model can be trained faster, more accurate, with better generalization ability for the road segmentation task and high prediction confidence for the object detection task

Summary

Introduction

Convolutional networks (ConvNets) contributed a lot to the dramatic improvements in computer vision-related tasks. Zeiler and Fergus [27] demonstrated that the features learned by ConvNets are hierarchical, while the bottom layers focus on low-level features like corners, edges, etc., the top layers pay more attention to high-level features Inspired by this idea, this paper proposes a model called OFA-Net (One For All, which means One model For All results required) to do driving environment images road segmentation and object detection tasks. The model consists of three parts serving as feature extractor, detection, and segmentation, respectively It feeds object detection and semantic segmentation data alternately, and uses two different loss functions to train each task, respectively. Strategy are speeding up the convergence, improving segmentation accuracy and enhancing prediction confidence for object detection

Related Work

Transfer Learning and Multi-Task Learning

Simultaneous Detection and Segmentation

Initialization

Loss Functions and Loss Value Balancing

Alternate Training Strategy

Dataset Split and Experiments

Hyper Parameters

How Does Detection Affect Segmentation?

How Does Segmentation affect Detection?

OFA-Net Results

Conclusions

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Sciences	Publication Date: Dec 18, 2019
Citations: 14	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

One For All: A Mutual Enhancement Method for Object Detection and Semantic Segmentation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Joint Multiclass Object Detection and Semantic Segmentation for Autonomous Driving
Shakhboz Abdigapporov ... Hakil Kim
IEEE Access | VOL. 11
Shakhboz Abdigapporov, et. al.Shakhboz Abdigapporov ... Hakil Kim
01 Jan 2023
IEEE Access | VOL. 11

Learning to capture dependencies between global features of different convolution layers
Zhangwei Li ... Guijun Zhang
Journal of Visual Communication and Image Representation | VOL. 81
Zhangwei Li, et. al.Zhangwei Li ... Guijun Zhang
30 Oct 2021
Journal of Visual Communication and Image Representation | VOL. 81

A loss-balanced multi-task model for simultaneous detection and segmentation
Wenwen Zhang ... Fei-Yue Wang
Neurocomputing | VOL. 428
Wenwen Zhang, et. al.Wenwen Zhang ... Fei-Yue Wang
28 Nov 2020
Neurocomputing | VOL. 428

Road Semantic Segmentation and Traffic Object Detection Model Based on Encoder-Decoder CNN Architecture
Yih-Chen Wang ... Yen-Lin Chen
-
Yih-Chen Wang, et. al.Yih-Chen Wang ... Yen-Lin Chen
06 Jul 2022
06 Jul 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

One For All: A Mutual Enhancement Method for Object Detection and Semantic Segmentation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences