Abstract

The census transform is a non-parametric local transform that is widely used in stereo matching. This transform encodes the structural information of a local patch into a binary code stream representing the relative intensity ordering of the pixels within the patch. Despite its high performance in stereo matching, the census transform often generates identical binary code streams for two different patches because it simply thresholds the pixels within the patch at the center pixel intensity. To overcome this problem, we introduce a quaternary census transform that encodes the local structural information into a quaternary code stream by employing both the relative intensity ordering and the minimum visibility threshold of the human eye known as the just-noticeable difference. Moreover, because the human eye activates different areas of the retina based on brightness, the patch size for the proposed quaternary census transform adaptively varies depending on the luminance of each pixel. Experimental results on well-known Middlebury stereo datasets prove that the proposed transform outperforms the other census transform-based methods in terms of the accuracy of stereo matching.

Highlights

  • Stereo matching is one of the most extensively studied topics in computer vision [1]–[3]

  • We introduce a quaternary census transform (QCT) that adopts two properties of the human visual system to solve the limitations of the conventional census transform and other census transform-based methods

  • Ji et al.: Quaternary Census Transform Based on the Human Visual System for Stereo Matching TABLE 1

Read more

Summary

Introduction

Stereo matching is one of the most extensively studied topics in computer vision [1]–[3]. To achieve high-performance of stereo matching, various similarity cost calculation methods have been proposed such as sum of absolute difference [6], relative gradient [7], normalized cross-correlation [8], Mahalanobis distance crosscorrelation [9], and census transform [10]. Among these methods, the census transform has been popularly used for stereo matching because it recorded the highest performance [11]. The census transform [10] summarizes the local image structure of a square patch as a binary code stream that

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call