Abstract

Visual sensor network (VSN) requires a multi-focus image or video frame fusion technique involving focus measure computation in the DCT-domain to generate an all-in-focus image. Such techniques are implemented on resource-constrained on-board systems requiring hardware-friendly implementations. In this article, we first show that components of the Laplacian matrix are related to the discrete cosine transform (DCT) basis. The relation is that the eigenvalues of the Laplacian with proper boundary condition form the diagonal elements of the diagonal matrix generated by the DCT operation on the Laplacian. Exploiting this relation, we propose a focus measure which works on the DCT coefficients reflecting the spatial-domain Laplacian operation. Certain simplifications allow our focus measure computation through hardware-friendly integer multiplication and summation, where matrix multiplication involves just N scalar multiplications for an N × N 2D signal. Finally, we propose an approach which suitably fuses multi-focus images or video frames in DCT based image or video coding framework through detection of properly focused area and neighborhood consistency analysis. We show that our proposed approach is hardwarefriendly, computationally simple, and is fast enough for VSN. Through experimental results, we show that our approach outperforms the relevant state-of-the-art in multi-focus image fusion for VSN both quantitatively and subjectively. We also show that our approach is effective in comparison to the state-of-the-art and a few latest generic multi-focus image fusion techniques in terms of quantitative and subjective evaluations.

Highlights

  • Visual sensor network (VSN) is an intelligent system [1], [2] which generates huge sensory data from the environment through geographically distributed camera nodes and collaboratively processes [3] to send useful information for different applications related to surveillance, monitoring, etc. [4]–[6]

  • A simple multi-focus image fusion technique that works and generates an all-in-focus image/frame in discrete cosine transform (DCT) domain would be preferable for VSN, as it can be directly incorporated in the DCT based compression framework

  • Amin-Naji et al [22] proposed a multi-focus image fusion technique which fuses the images based on the geometric mean of the largest five eigenvalues obtained through the singular value decomposition (SVD) on the input blocks in DCT domain

Read more

Summary

INTRODUCTION

Visual sensor network (VSN) is an intelligent system [1], [2] which generates huge sensory data from the environment through geographically distributed camera nodes and collaboratively processes [3] to send useful information for different applications related to surveillance, monitoring, etc. [4]–[6]. A simple multi-focus image fusion technique that works and generates an all-in-focus image/frame in DCT domain would be preferable for VSN, as it can be directly incorporated in the DCT based compression framework. Amin-Naji et al [20] proposed the use of variance and energy of Laplacian response in DCT domain for multi-focus image fusion in VSN

LAPLACIAN RESPONSE THROUGH EFFICIENT DCT DOMAIN COMPUTATION
LAPLACIAN RESPONSE FOR A BLOCK BASED OPERATION
PROPOSED FOCUS MEASURE
COMPUTATION OF FOCUS MEASURE
ESTIMATION OF INITIAL DECISION MAP
REFINED DECISION MAP
EXPERIMENTAL RESULTS AND DISCUSSION
QUANTITATIVE EVALUATION
CONCLUSION
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call