Abstract
AbstractThe exponential increase in video consumption has created new difficulties for browsing and navigating through video more effectively and efficiently. Researchers are interested in video summarization because it offers a brief but instructive video version that helps users and systems save time and effort when looking for and comprehending relevant content. Key frame extraction is a method of video summarization that only chooses the most important frames from a given video. In this article, a novel supervised learning method ‘TC‐CLSTM Auto Encoder with Mode‐based Learning’ using temporal and spatial features is proposed for automatically choosing keyframes or important sub‐shots from videos. The method was able to achieve an average F‐score of 84.35 on TVSum dataset. Extensive tests on benchmark data sets show that the suggested methodology outperforms state‐of‐the‐art methods.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.