The Nonlinear Statistics of High-Contrast Patches in Natural Images

Ann B Lee

doi:10.1023/a:1023705401078

Ann B Lee

Open Access

PDF Available

https://doi.org/10.1023/a:1023705401078

Copy DOI

Export

Save

Cite

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

Recently, there has been a great deal of interest in modeling the non-Gaussian structures of natural images. However, despite the many advances in the direction of sparse coding and multi-resolution analysis, the full probability distribution of pixel values in a neighborhood has not yet been described. In this study, we explore the space of data points representing the values of 3 × 3 high-contrast patches from optical and 3D range images. We find that the distribution of data is extremely “sparse” with the majority of the data points concentrated in clusters and non-linear low-dimensional manifolds. Furthermore, a detailed study of probability densities allows us to systematically distinguish between images of different modalities (optical versus range), which otherwise display similar marginal distributions. Our work indicates the importance of studying the full probability distribution of natural images, not just marginals, and the need to understand the intrinsic dimensionality and nature of the data. We believe that object-like structures in the world and the sensor properties of the probing device generate observations that are concentrated along predictable shapes in state space. Our study of natural image statistics accounts for local geometries (such as edges) in natural scenes, but does not impose such strong assumptions on the data as independent components or sparse coding by linear change of bases.

Highlights

A number of recent attempts have been made to describe the non-Gaussian statistics of natural images (Field, 1987; Ruderman and Bialek, 1994; Olshausen and Field, 1996; Huang and Mumford, 1999; Simoncelli, 1999b; Grenander and Srivastava, 2001)
The research in natural image statistics can roughly be divided into two related directions
There are studies of image statistics which try to find an “optimal” set of linear projections or basis functions in the state space defined by the image data (8 × 8 patches, for example, define a distribution in R64)

Summary

Introduction

A number of recent attempts have been made to describe the non-Gaussian statistics of natural images (Field, 1987; Ruderman and Bialek, 1994; Olshausen and Field, 1996; Huang and Mumford, 1999; Simoncelli, 1999b; Grenander and Srivastava, 2001). There are studies of image statistics which try to find an “optimal” set of linear projections or basis functions in the state space defined by the image data (8 × 8 patches, for example, define a distribution in R64). Our analysis is divided into three parts: In Section 4 we study the distribution of our data with respect to a Voronoi tessellation of the space of data points This first part, is a model-free first exploration of the state space of contrast-normalized patches.

Optical and Range Data Sets

Preprocessing

A First Exploration of the 7-Sphere

Optical Data

The Ideal Manifold of Edges

Density of Optical Data as a Function of the Surface Parameters

Range Data Comparison

Density as a Function of Distance to Nearest Binary Patch

Range Data

Distribution of Range Patches Across the 50 Binary Symmetry Classes

Summary and Conclusions

Findings