UniColor

Zhitong Huang,Jing Liao,Nanxuan Zhao

doi:10.1145/3550454.3555471

UniColor

Zhitong Huang, Jing Liao + Show 1 more

Open Access

https://doi.org/10.1145/3550454.3555471

Copy DOI

Abstract

We propose the first unified framework UniColor to support colorization in multiple modalities, including both unconditional and conditional ones, such as stroke, exemplar, text, and even a mix of them. Rather than learning a separate model for each type of condition, we introduce a two-stage colorization framework for incorporating various conditions into a single model. In the first stage, multi-modal conditions are converted into a common representation of hint points. Particularly, we propose a novel CLIP-based method to convert the text to hint points. In the second stage, we propose a Transformer-based network composed of Chroma-VQGAN and Hybrid-Transformer to generate diverse and high-quality colorization results conditioned on hint points. Both qualitative and quantitative comparisons demonstrate that our method outperforms state-of-the-art methods in every control modality and further enables multi-modal colorization that was not feasible before. Moreover, we design an interactive interface showing the effectiveness of our unified framework in practical usage, including automatic colorization, hybrid-control colorization, local recolorization, and iterative color editing. Our code and models are available at https://luckyhzt.github.io/unicolor .

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

UniColor

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Graphics

Lead the way for us

Journal: ACM Transactions on Graphics	Publication Date: Nov 30, 2022
Citations: 11

Similar Papers

A Novel Semi-Supervised Learning Approach in Artificial Olfaction for E-Nose Application
Lei Zhang ... Xin Yin
IEEE Sensors Journal | VOL. 16
Lei Zhang, et. al.Lei Zhang ... Xin Yin
01 Jun 2016
IEEE Sensors Journal | VOL. 16

Learning Semi-Supervised Representation Towards a Unified Optimization Framework for Semi-Supervised Learning
Chun-Guang Li ... Jun Guo
-
Chun-Guang Li, et. al.Chun-Guang Li ... Jun Guo
01 Dec 2015
01 Dec 2015

Show Me What and Tell Me How: Video Synthesis via Multimodal Conditioning
Ligong Han ... Hsin-Ying Lee
-
Ligong Han, et. al.Ligong Han ... Hsin-Ying Lee
01 Jun 2022
01 Jun 2022

Three-way parallel group independent component analysis: Fusion of spatial and spatiotemporal magnetic resonance imaging data.
Shile Qi ... Vince D Calhoun
Human brain mapping | VOL. 43
Shile Qi, et. al.Shile Qi ... Vince D Calhoun
22 Nov 2021
Human brain mapping | VOL. 43

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

UniColor

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Graphics