Abstract

Saliency detection has been applied in many cases. This paper proposes a 2D hidden Markov model (2D-HMM) which exploits the hidden semantic information of image to detect the salient regions. A spatial pyramid Histogram of Oriented Gradient (SP-HOG) descriptor is used to extract feature. After encoding the image by a learned dictionary, the 2D-viterbi algorithm is applied to inferring the saliency map. This model can depict the shapes of targets, and also it is robust to the targets’ change of posture and viewpoint. To validate the model with human’s visual search mechanism, eye track experiment is employed to train our model directly from the eye data. The results show that our model achieves a better performance than eye data. Moreover, it indicates that learning from eye track data to figure out their targets is possible.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call