Pixel-wise Prediction Research Articles

Overview

100 Articles

Published in last 50 years

Articles published on Pixel-wise Prediction

99 Search results

Prototype-Based Semantic Segmentation.

Deep learning based semantic segmentation solutions have yielded compelling results over the preceding decade. They encompass diverse network architectures (FCN based or attention based), along with various mask decoding schemes (parametric softmax based or pixel-query based). Despite the divergence, they can be grouped within a unified framework by interpreting the softmax weights or query vectors as learnable class prototypes. In light of this prototype view, we reveal inherent limitations within the parametric segmentation regime, and accordingly develop a nonparametric alternative based on non-learnable prototypes. In contrast to previous approaches that entail the learning of a single weight/query vector per class in a fully parametric manner, our approach represents each class as a set of non-learnable prototypes, relying solely upon the mean features of training pixels within that class. The pixel-wise prediction is thus achieved by nonparametric nearest prototype retrieving. This allows our model to directly shape the pixel embedding space by optimizing the arrangement between embedded pixels and anchored prototypes. It is able to accommodate an arbitrary number of classes with a constant number of learnable parameters. Through empirical evaluation with FCN based and Transformer based segmentation models (i.e., HRNet, Swin, SegFormer, Mask2Former) and backbones (i.e., ResNet, HRNet, Swin, MiT), our nonparametric framework shows superior performance on standard segmentation datasets (i.e., ADE20 K, Cityscapes, COCO-Stuff), as well as in large-vocabulary semantic segmentation scenarios. We expect that this study will provoke a rethink of the current de facto semantic segmentation model design.

IEEE transactions on pattern analysis and machine intelligence

Oct 1, 2024
Tianfei Zhou + 1

Editage

Paperpal

R Discovery

Mind the Graph

Pixel-wise Prediction Research Articles

Related Topics

Articles published on Pixel-wise Prediction

Prototype-Based Semantic Segmentation.

Exploring Generalizable Distillation for Efficient Medical Image Segmentation.

Information-Theoretic Exploration for Adaptive Robotic Grasping in Clutter Based on Real-Time Pixel-Level Grasp Detection

FCSN: Global Context Aware Segmentation by Learning the Fourier Coefficients of Objects in Medical Images.

Fine-Grained Visual-Text Prompt-Driven Self-Training for Open-Vocabulary Object Detection.

Processing of electrical resistivity tomography data using convolutional neural network in ERT-NET architectures

Faulty-Feeder Detection for Single Phase-to-Ground Faults in Distribution Networks Based on Waveform Encoding and Waveform Segmentation

Graph isomorphism U-Net

A Robust Pixel-Wise Prediction Network With Applications to Industrial Robotic Grasping

PL-GNet: Pixel Level Global Network for detection and localization of image forgeries

Haar wavelet downsampling: A simple but effective downsampling module for semantic segmentation

LeNo: Adversarial Robust Salient Object Detection Networks with Learnable Noise

AN END-TO-END DEEP LEARNING WORKFLOW FOR BUILDING SEGMENTATION, BOUNDARY REGULARIZATION AND VECTORIZATION OF BUILDING FOOTPRINTS

Annotation-efficient learning for OCT segmentation.

Challenges and implications of predicting the spatiotemporal distribution of dengue fever outbreak in Chinese Taiwan using remote sensing data and deep learning

ActFloor-GAN: Activity-Guided Adversarial Networks for Human-Centric Floorplan Design.

Adaptive Perspective Distillation for Semantic Segmentation.

Dense FixMatch: a simple semi-supervised learning method for pixel-wise prediction tasks

SDANet: Semantic-Embedded Density Adaptive Network for Moving Vehicle Detection in Satellite Videos.

Road Topology Extraction From Satellite Imagery by Joint Learning of Nodes and Their Connectivity