Abstract

Surveillance videos of public places often consist of group activities composed from multiple co-occurring individual activities. However, latent topic models, such as Latent Dirichlet Allocation (LDA), which have been successfully used to discover individual activities, do not discover group activities. In this paper we propose a method to discover group activities along with individual activities. We use a two layer latent structure where a latent variable is used to discover correlation of individual activities as a group activity using multinomial distribution. Each individual activity is in turn represented as a distribution over local visual features. We use a Gibbs sampling-based algorithm to jointly infer the individual and group activities. Our method can summarize not only the individual activities but also the common group activities in a video. We demonstrate the strength of our method by discovering activities and the salient correlation amongst them in real life videos of crowded public places.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.