Flattenet: A Simple and Versatile Framework for Dense Pixelwise Prediction

Xin Cai,Yi-Fei Pu

doi:10.1109/access.2019.2959640

Abstract

In this paper, we focus on devising a versatile framework for dense pixelwise prediction whose goal is to assign a discrete or continuous label to each pixel for an image. It is well-known that the reduced feature resolution due to repeated subsampling operations poses a serious challenge to Fully Convolutional Network (FCN) based models. In contrast to the commonly-used strategies, such as dilated convolution and encoder-decoder structure, we introduce the Flattening Module to produce high-resolution predictions without either removing any subsampling operations or building a complicated decoder module. In addition, the Flattening Module is lightweight and can be easily combined with any existing FCNs, allowing the model builder to trade off among model size, computational cost and accuracy by simply choosing different backbone networks. We empirically demonstrate the effectiveness of the proposed Flattening Module through competitive results in human pose estimation on MPII, semantic segmentation on PASCAL-Context and object detection on PASCAL VOC. We hope that the proposed approach can serve as a simple and strong alternative of current dominant dense pixelwise prediction frameworks.

Highlights

Many fundamental computer vision tasks can be formulated as a dense pixelwise prediction problem
We introduce a novel scheme to produce dense pixelwise predictions based on the proposed lightweight Flattening Module while avoiding either removing any subsampling operations or building a complex decoder module
FLATTENET we firstly present a general framework for addressing the dense pixelwise prediction problem, from which our specific instantiation is derived, and introduce the Flattening Module

Summary

Introduction

Many fundamental computer vision tasks can be formulated as a dense pixelwise prediction problem. There is a growing interest in reducing anchor boxes based object detection to a pixelwise prediction problem [12]–[15]. Deep learning methods, and in particular deep convolutional neural networks (DCNNs) based on the Fully Convolutional Network (FCN) framework [16], have achieved tremendous success in such dense pixelwise prediction tasks. It is well-known that the major issue for current FCN based models is the reduced feature

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2019
Citations: 65	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Flattenet: A Simple and Versatile Framework for Dense Pixelwise Prediction

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Image Semantic Segmentation Based on Dilated Convolution and Multi-Layer Feature Fusion
Jun Liu ... Guoyun Zhong
-
Jun Liu, et. al.Jun Liu ... Guoyun Zhong
28 May 2021
28 May 2021

Development of Semantic Segmentation Based on Deep Learning
Yang Zhao
Highlights in Science, Engineering and Technology | VOL. 34
Yang ZhaoYang Zhao
28 Feb 2023
Highlights in Science, Engineering and Technology | VOL. 34

A Review of Semantic Segmentation Based on Context Information
Wei Xu ... Lingjun Yang
-
Wei Xu, et. al.Wei Xu ... Lingjun Yang
01 Dec 2018
01 Dec 2018

Multi-evidence Filtering and Fusion for Multi-label Classification, Object Detection and Semantic Segmentation Based on Weakly Supervised Learning
Weifeng Ge ... Yizhou Yu
-
Weifeng Ge, et. al.Weifeng Ge ... Yizhou Yu
01 Jun 2018
01 Jun 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Flattenet: A Simple and Versatile Framework for Dense Pixelwise Prediction

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access