Supervoxel-based segmentation of 3D imagery with optical flow integration for spatiotemporal processing

Xiaohui Huang,Sanjay Ranka,Chengliang Yang,Anand Rangarajan

doi:10.1186/s41074-018-0045-8

Xiaohui Huang, Sanjay Ranka + Show 2 more

Open Access

https://doi.org/10.1186/s41074-018-0045-8

Copy DOI

Abstract

The past 20 years has seen a progressive evolution of computer vision algorithms for unsupervised 2D image segmentation. While earlier efforts relied on Markov random fields and efficient optimization (graph cuts, etc.), the next wave of methods beginning in the early part of this century were, in the main, stovepiped. Of these 2D segmentation efforts, one of the most popular and, indeed, one that comes close to being a state of the art method is the ultrametric contour map (UCM). The pipelined methodology consists of (i) computing local, oriented responses, (ii) graph creation, (iii) eigenvector computation (globalization), (iv) integration of local and global information, (v) contour extraction, and (vi) superpixel hierarchy construction. UCM performs well on a range of 2D tasks. Consequently, it is somewhat surprising that no 3D version of UCM exists at the present time. To address that lack, we present a novel 3D supervoxel segmentation method, dubbed 3D UCM, which closely follows its 2D counterpart while adding 3D relevant features. The methodology, driven by supervoxel extraction, combines local and global gradient-based features together to first produce a low-level supervoxel graph. Subsequently, an agglomerative approach is used to group supervoxel structures into a segmentation hierarchy with explicitly imposed containment of lower-level supervoxels in higher-level supervoxels. Comparisons are conducted against state of the art 3D segmentation algorithms. The considered applications are 3D spatial and 2D spatiotemporal segmentation scenarios. For the latter comparisons, we present results of 3D UCM with and without optical flow video pre-processing. As expected, when motion correction beyond a certain range is required, we demonstrate that 3D UCM in conjunction with optical flow is a very useful addition to the pantheon of video segmentation methods.

Highlights

The ready availability of 3D datasets and video has opened up the need for more 3D computational tools
We introduced transfer learning to 3D ultrametric contour map (UCM) without the intermediate supervised convolutional neural networks (CNNs) layers as in Convolutional oriented boundaries (COB)
In 2D, the normalized cuts approach begins with sparse graph construction obtained by connecting pixels that are spatially close to each other. Globalized probability map UCM (gPb-UCM) [1] specifies a sparse symmetric affinity matrix W using the intervening contour cue [5] which is the maximal value of mPb along a line connecting the two pixels i, j at the ends of relation Wij

Summary

Introduction

The ready availability of 3D datasets and video has opened up the need for more 3D computational tools. The top eigenvectors of the graph are extracted, placed in image coordinates followed by gradient computation This results in the sPb(x, y, θ) detector which carries global information as it is derived from eigenvector “images.” the globalized probability detector gPb(x, y, θ) is computed via a weighted linear combination of mPb and sPb. This results in the sPb(x, y, θ) detector which carries global information as it is derived from eigenvector “images.” the globalized probability detector gPb(x, y, θ) is computed via a weighted linear combination of mPb and sPb While this completes the pipeline in terms of information accrued for segmentation, UCM proceeds to obtain a set of closed regions using gPb as the input via the application of the oriented watershed transform (OWT). We perform graph-based agglomeration using all voxels following recent work With these changes to the pipeline, the 3D UCM framework is broadly subdivided into (i) local, volume gradient detection, (ii) globalization using reduced order eigensolvers, and (iii) graph-based agglomeration to reflect the emphasis on the changed subsystems. The upside is that 3D UCM becomes scalable to handle sizable datasets

Local gradient feature extraction

Graph construction and the oriented intervening contour cue

Reduced order normalized cuts and eigenvector computation

Scale space gradient computation on the eigenvector image

The combination of local and global gradient information

Discussion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IPSJ Transactions on Computer Vision and Applications	Publication Date: Jun 19, 2018
Citations: 6	License type: open-access

R Discovery Prime

R Discovery Prime

Supervoxel-based segmentation of 3D imagery with optical flow integration for spatiotemporal processing

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IPSJ Transactions on Computer Vision and Applications

Lead the way for us

Similar Papers

Constrained image segmentation from hierarchical boundaries
Pablo Arbelaez ... Laurent Cohen
-
Pablo Arbelaez, et. al.Pablo Arbelaez ... Laurent Cohen
01 Jun 2008
01 Jun 2008

3D segmentation of abdominal CT imagery with graphical models, conditional random fields and learning
Chetan Bhole ... Axel Wismüller
Machine Vision and Applications | VOL. 25
Chetan Bhole, et. al.Chetan Bhole ... Axel Wismüller
11 Apr 2013
Machine Vision and Applications | VOL. 25

Automated Diagnostic System for Laryngeal Hemiplegia Using Endoscopic Image
Md Musfequs Salehin ... Lihong Zheng
International Journal of Signal Processing Systems | VOL. 1
Md Musfequs Salehin, et. al.Md Musfequs Salehin ... Lihong Zheng
01 Jan 2013
International Journal of Signal Processing Systems | VOL. 1

Hierarchical Models for Image Segmentation: from Color to Texture

-

01 Dec 2008
01 Dec 2008

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Supervoxel-based segmentation of 3D imagery with optical flow integration for spatiotemporal processing

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IPSJ Transactions on Computer Vision and Applications