Abstract

The video summarization plays a momentous role for real-time surveillance for detection of suspicious human activities in public scenes. However, efficient, and accurate human-centric video summarization in real-time videos is still a challenging issue due to several inconsistencies such as noise, illumination effects, scale, or rotational variance, etc. Though, several prominent research studies are available related to human activities detection but a robust human-centric video summarization approach is a major issue. In this research, we expound an efficient video summary generation model (ReHuSum) that creates a short summary with human. Our approach employs the conception of domain adaptation for efficient summarization via transferring the knowledge of a potent pre-trained benchmark Inception-v3 model. Additionally, a frame differencing redundancy removal technique is adopted that aids in generating a precise and accurate video summary with human. Our model generates efficient short summaries from real-time videos with an overall human detection accuracy of 99.97%. Besides, we present two novel qualitative metrics namely; summarization rate (Summrate) and summarization loss (Summloss) for assessing the efficacy of overall summarization. The approach generates a real-time human-based summary of 46s with a (Summrate) of 98.7% and (Summloss) of 0.013 for an outdoor CCTV with a duration of 25.2 min. Similarly, an indoor CCTV of 40.10 min duration is summarized to 44s at a (Summrate) of 96.5% and (Summloss) of 0.035. The model can be utilized in real-time security applications (i.e., border or home surveillance, critical infrastructure, smart cities, etc.) to generate accurate video summaries with human.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call