Abstract

We introduce an automatic segmentation framework that blends the advantages of color-, texture-, shape-, and motion-based segmentation methods in a computationally feasible way. A spatiotemporal data structure is first constructed for each group of video frames, in which each pixel is assigned a feature vector based on low-level visual information. Then, the smallest homogeneous components, so-called volumes, are expanded from selected marker points using an adaptive, three-dimensional, centroid-linkage method. Self descriptors that characterize each volume and relational descriptors that capture the mutual properties between pairs of volumes are determined by evaluating the boundary, trajectory, and motion of the volumes. These descriptors are used to measure the similarity between volumes based on which volumes are further grouped into objects. A fine-to-coarse clustering algorithm yields a multiresolution object tree representation as an output of the segmentation.

Highlights

  • Object segmentation is important for video compression standards as well as recognition, event analysis, understanding, and video manipulation

  • We selected a version of the proposed video object segmentation (VOS) framework to be used as a reference considering the computational simplicity, that is, texture features and motion parameters are omitted

  • Intra-inter switching method is preferred to prevent a volume from having disconnected regions

Read more

Summary

Introduction

Object segmentation is important for video compression standards as well as recognition, event analysis, understanding, and video manipulation. Segmentation techniques can be grouped into three classes: region-based methods using a homogeneous color or texture criterion, motion-based approaches utilizing a homogeneous motion criterion, and object tracking. Color-clusteringbased methods often utilize histograms and they are computationally simple. Histogram analysis delivers satisfactory segmentation result especially for multimodal color distributions, and where the input data set is relatively simple, clean, and fits the model well. Region-growing-based techniques provide better performance in terms of spatial connectivity and boundary accuracy than histogram-based methods. A common problem of histogram and region-based methods arises from the fact that a video object can contain several totally different colors

Methods
Results
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.