Abstract

Grayscale image colorization is known as an ill-posed problem because of the imbalanced matching between intensity and color values. Even given prior hints about the original color image, existing colorization methods cannot recover the original color image from grayscale faithfully. In this paper, we propose to embed color information into an invertible grayscale, such that it can be easily recovered to the original color. However, a vanilla encoding-decoding network cannot produce rich representations of color information and thus the reconstruction quality is limited. Moreover, due to the neglect of the discrimination of color information, it cannot embed color information into visually inconspicuous patterns located in the grayscale. In this paper, we propose a novel color-encoding schema, dual features ensemble network (DFENet), for the effective embedding and faithfully reconstruction. In particular, we complement the residual representations with dense representations, to integrate the ability of local residual learning and local feature fusion. Furthermore, we propose an element-wise self-attention mechanism that highlights the discriminative features and suppresses the redundant ones generated from the dual path module. Extensive experiments demonstrate the proposed method outperforms state-of-the-art methods in terms of reconstruction quality as well as the similarity between the generated invertible grayscale and its groundtruth.

Highlights

  • Color-to-gray conversion is widely applied to aesthetic stylization, monochrome printing and so on

  • We present to incorporate a dual features ensemble block in our dual features ensemble network (DFENet), which is constructed by a dual path module and an ensemble inference module

  • In order to capture rich color and contextual representations, we propose a novel color-encoding schema, dual features ensemble network (DFENet)

Read more

Summary

INTRODUCTION

Color-to-gray conversion is widely applied to aesthetic stylization, monochrome printing and so on. To distill features with affluent expressions of color information, the former one is composed of a series of assembled residual blocks and dense blocks in a dual path manner While the latter is introduced to investigate and exploit the implicit discriminative correlation between the generated features via a combined spatial-wise and channel-wise attention strategy. They use a vanilla U-net for embedding and decoding, preventing both the invertible grayscale and the reconstructed color image from a high image quality We address this problem by proposing a dual features ensemble network, achieving a high embedding and decoding performance. APPROACH we propose a novel method named DFENet, which is devoted to generate a grayscale image that can efficiently recover its original colors.

NETWORK ARCHITECTURE
DUAL FEATURES ENSEMBLE BLOCK
OPTIMIZATION FUNCTION
IMPLEMENTATION DETAILS
Findings
CONCLUSION
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call