Abstract

A novel method for detection of important scenes in baseball videos based on correlation maximization between heterogeneous modalities via bidirectional time lag aware deep multiset canonical correlation analysis (BiTl-dMCCA) is presented in this paper. The proposed method enables detection of important scenes by collaboratively using baseball videos and their corresponding tweets. The technical contributions of this paper are twofold. First, since there are time lags between not only “tweets and corresponding multiple previous events” but also “events and corresponding multiple following posted tweets”, the proposed method considers these bidirectional time lags. Specifically, the representation of such bidirectional time lags into the derivation of their covariance matrices is newly introduced. Second, the proposed method adopts textual, visual and audio features calculated from tweets and videos as multi-modal time series features. Important scenes are detected as abnormal scenes via anomaly detection based on a generative adversarial network using multi-modal features projected by BiTl-dMCCA. The proposed method does not need any training data with annotation. Experimental results obtained by applying the proposed method to actual baseball matches show the effectiveness of the proposed method.

Highlights

  • Many video distribution services have recently become popular due to the development of various network technologies and devices

  • We propose a method for detection of important scenes in baseball videos that considers bidirectional time lags

  • By focusing on the above characteristics, the proposed method detects important scenes as abnormal scenes via anomaly detection based on a generative adversarial network (GAN) [18] using multimodal features projected by BiTl-dMCCA

Read more

Summary

INTRODUCTION

Many video distribution services have recently become popular due to the development of various network technologies and devices. Various methods for detection of important scenes that can consider the opinions of viewers have been realized by using Twitter4 [7]–[9]. We propose a method for detection of important scenes in baseball videos that considers bidirectional time lags. The proposed method newly derives bidirectional time lag aware deep multiset canonical correlation analysis (BiTl-dMCCA), which is an extended version of dMCCA [17], to consider bidirectional time lags depending on events occurring and posted tweets. By focusing on the above characteristics, the proposed method detects important scenes as abnormal scenes via anomaly detection based on a generative adversarial network (GAN) [18] using multimodal features projected by BiTl-dMCCA. The proposed method detects important scenes using tweets posted by viewers and baseball videos.

MULTI-MODAL FEATURE EXTRACTION
CONSTRUCTION OF A GAN-BASED DETECTION
IMPORTANT SCENE DETECTION
PERFORMANCE EVALUATION
CONCLUSION

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.