Extracting semantic video objects

Fuhui Long Fuhui Long,Wan-Chi Siu Wan-Chi Siu,Hanchuan Peng Hanchuan Peng,Dagan Feng Dagan Feng

doi:10.1109/38.895132

Fuhui Long Fuhui Long, Wan-Chi Siu Wan-Chi Siu + Show 2 more

Open Access

https://doi.org/10.1109/38.895132

Copy DOI

Abstract

We present an accurate and user-interactive semantic video object (SVO) extraction system. Although we also obtain an SVO with an accurate boundary by integrating temporal and spatial information, our way is quite different from others' work. Instead of fusing spatial and temporal segmentations on the first or all the frames of a video sequence, our system adaptively performs spatial and temporal segmentation and fusion when necessary. To achieve this, our system detects the variations between successive frames. We only need to fuse the spatial and temporal segmentation when a large variation occurs. Otherwise, the system tracks the previous SVO's boundary. We find this simple method efficient in both speed and accuracy. Since the temporal segmentation, spatial segmentation, spatio-temporal fusion, and boundary tracking all employ simple algorithms, our system has a low computational complexity.

Full Text