CRFNet: Context ReFinement Network used for semantic segmentation

Taeghyun An,Jungyu Kang,Dooseop Choi,Kyoung‐Wook Min

doi:10.4218/etrij.2023-0017

Taeghyun An, Jungyu Kang + Show 2 more

Open Access

https://doi.org/10.4218/etrij.2023-0017

Copy DOI

Abstract

AbstractRecent semantic segmentation frameworks usually combine low‐level and high‐level context information to achieve improved performance. In addition, postlevel context information is also considered. In this study, we present a Context ReFinement Network (CRFNet) and its training method to improve the semantic predictions of segmentation models of the encoder–decoder structure. Our study is based on postprocessing, which directly considers the relationship between spatially neighboring pixels of a label map, such as Markov and conditional random fields. CRFNet comprises two modules: a refiner and a combiner that, respectively, refine the context information from the output features of the conventional semantic segmentation network model and combine the refined features with the intermediate features from the decoding process of the segmentation model to produce the final output. To train CRFNet to refine the semantic predictions more accurately, we proposed a sequential training scheme. Using various backbone networks (ENet, ERFNet, and HyperSeg), we extensively evaluated our model on three large‐scale, real‐world datasets to demonstrate the effectiveness of our approach.

Full Text