Abstract

The main objective of this research paper is to design a system which would generate multimodal, nonparametric Bayesian model, and multilayered probability latent semantic analysis (pLSA)-based visual dictionary (BM-MpLSA). Advancement in technology and the exuberance of sports lovers have necessitated a requirement for automatic action recognition in the live video seed of sports. The fundamental requirement for such model is the creation of visual dictionary for each sports domain. This multimodal nonparametric model has two novel co-occurrence matrix creation—one for image feature vector and the other for textual entities. This matrix provides a basic scaling parameter for the unobserved random variables, and it is an extension of multilayered pLSA-based visual dictionary creation. This paper precisely concentrates on the creation of visual dictionary for Basketball. From the sports event images, the feature vector extracted is modified as SIFT and MPEG 7’s-based dominant color, color layout, scalable color and edge histograms. After quantization and analysis of these vector values, the visual vocabulary would be created by integrating them into the domain specific visual ontology for semantic understanding. The accuracy rate of this work is compared with respect to the action held on image based on performance.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.