Searching for Fast Demosaicking Algorithms

Karima Ma,Shoaib Kamil,Connelly Barnes,Michael Gharbi,Tzu-Mao Li,Andrew Adams,Jonathan Ragan-Kelley

doi:10.1145/3508461

Abstract

We present a method to automatically synthesize efficient, high-quality demosaicking algorithms, across a range of computational budgets, given a loss function and training data. It performs a multi-objective, discrete-continuous optimization which simultaneously solves for the program structure and parameters that best tradeoff computational cost and image quality. We design the method to exploit domain-specific structure for search efficiency. We apply it to several tasks, including demosaicking both Bayer and Fuji X-Trans color filter patterns, as well as joint demosaicking and super-resolution. In a few days on 8 GPUs, it produces a family of algorithms that significantly improves image quality relative to the prior state-of-the-art across a range of computational budgets from 10 s to 1000 s of operations per pixel (1 dB–3 dB higher quality at the same cost, or 8.5–200× higher throughput at same or better quality). The resulting programs combine features of both classical and deep learning-based demosaicking algorithms into more efficient hybrid combinations, which are bandwidth-efficient and vectorizable by construction. Finally, our method automatically schedules and compiles all generated programs into optimized SIMD code for modern processors.

Highlights

At the same time, demosaicking must often be performed under extreme computational budgets: a single stream of 4K 60 FPS video requires processing 0.5 gigapixels per second
Our programs are Pareto-dominant: they offer both significantly higher quality (1 dB–3 dB) at the same computational cost as any prior algorithm in the same range, and can deliver comparable or better image quality at dramatically lower computational cost (8.5–220× or more). They are designed for efficient streaming SIMD implementation, and automatically compile to highlyoptimized kernels for modern processors. We generate this family of new algorithms automatically by developing a multi-objective, discrete-continuous search which simultaneously solves for the program structure and parameters to find the best tradeoff between computational cost and image quality in a target range of computational budgets
Our search process significantly improves the quality vs. performance tradeoff of existing programs in the real-time performance regime, and spans a frontier of stateof-the-art algorithms covering a throughput range of 10—100 Megapixels per second on a single CPU core

Summary

INTRODUCTION

Demosaicking is among the most ubiquitous and performancecritical image processing tasks. Our programs are Pareto-dominant: they offer both significantly higher quality (1 dB–3 dB) at the same computational cost as any prior algorithm in the same range, and can deliver comparable or better image quality at dramatically lower computational cost (8.5–220× or more) They are designed for efficient streaming SIMD implementation, and automatically compile to highlyoptimized kernels for modern processors. We generate this family of new algorithms automatically by developing a multi-objective, discrete-continuous search which simultaneously solves for the program structure and parameters to find the best tradeoff between computational cost and image quality in a target range of computational budgets. — We define a search space that generates SIMD and localityfriendly algorithms by construction, and a compiler that exploits this structure to automatically generate highlyoptimized streaming implementations

RELATED WORK

Image Demosaicking

Super-resolution

Neural Architecture Search and Genetic Programming

TECHNIQUE

Building Blocks

Search Algorithm

Mutations

Fast Training

Efficiently Structured Search

Compiling Programs to Optimized Implementations

EVALUATION

Pareto-Dominant Programs

Bayer Demosaicking

X-Trans Demosaicking

Joint Bayer Demosaicking and Super-resolution

Ablation

Correlation Between Program Cost and Runtime

FUTURE WORK, LIMITATIONS, AND CONCLUSION

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: ACM Transactions on Graphics	Publication Date: May 13, 2022
Citations: 4	License type: cc-by

R Discovery Prime

R Discovery Prime

Searching for Fast Demosaicking Algorithms

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: ACM Transactions on Graphics

Lead the way for us

Similar Papers

Validation of a Deep Learning-based Automatic Detection Algorithm for Measurement of Endotracheal Tube-to-Carina Distance on Chest Radiographs.
Min‑Hsin Huang ... Ming-Huwi Horng
Anesthesiology | VOL. 137
Min‑Hsin Huang, et. al.Min‑Hsin Huang ... Ming-Huwi Horng
21 Sep 2022
Anesthesiology | VOL. 137

PR060/#838 Impact of ascites and peritoneal metastatic lesion volumes, measured by newly developed deep learning-based algorithm, in advanced epithelial ovarian cancer
Ui Suk Kim ... Hyun Hoon Chung
International Journal of Gynecologic Cancer | VOL. 33
Ui Suk Kim, et. al.Ui Suk Kim ... Hyun Hoon Chung
01 Nov 2023
International Journal of Gynecologic Cancer | VOL. 33

Parametric comparison between sparsity-based and deep learning-based image reconstruction of super-resolution fluorescence microscopy.
Junjie Chen ... Yun Chen
Biomedical Optics Express | VOL. 12
Junjie Chen, et. al.Junjie Chen ... Yun Chen
26 Jul 2021
Biomedical Optics Express | VOL. 12

Validation and diagnostic performance of a fast on-site deep learning-based CT-FFR algorithm
A Giannopoulos ... J D P Pack
European Heart Journal | VOL. 43
A Giannopoulos, et. al.A Giannopoulos ... J D P Pack
03 Oct 2022
European Heart Journal | VOL. 43

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Searching for Fast Demosaicking Algorithms

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: ACM Transactions on Graphics