Abstract
In movies and TV shows, it is common that several scenes repeat alternately. These videos are characterized with the long-term temporal correlation, which can be exploited to improve video coding efficiency. However, in applications supporting random access (RA), a video is typically divided into a number of RA segments (RASs) by RA points (RAPs), and different RASs are coded independently. In such a way, the long-term temporal correlation among RASs with similar scenes cannot be used. We present a scene-library-based video coding scheme for the coding of videos with repeated scenes. First, a compact scene library is built by clustering similar scenes and extracting representative frames in encoding video. Then, the video is coded using a layered scene-library-based coding structure, in which the library frames serve as long-term reference frames. The scene library is not cleared by RAPs so that the long-term temporal correlation between RASs from similar scenes can be exploited. Furthermore, the RAP frames are coded as interframes by only referencing library frames so as to improve coding efficiency while maintaining RA property. Experimental results show that the coding scheme can achieve significant coding gain over state-of-the-art methods.
Highlights
In video coding, the key to achieve high video coding efficiency is to make full use of correlations in video
The scene library is constructed by extracting the center frame of each cluster
The proposed scheme is capable of exploiting long-term temporal correlation between RA segments (RASs) which belong to similar scenes
Summary
The key to achieve high video coding efficiency is to make full use of correlations in video. The shortterm temporal correlation has been well exploited by current video coding. A substantial part of videos is characterized with long-term temporal correlation since they contain scenes that appear repeatedly. In news programs, the scenes of studio and logo clips may emerge at intervals. The images of the hosts, the guests, and the audience repeat alternately. In movies and TV series, many scenes in dialogue clips and flashback episodes appear repeatedly. The video coding efficiency would be highly improved if the long-term correlation is well exploited
Published Version (Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have