Detection of Important Scenes in Baseball Videos via a Time-Lag-Aware Multimodal Variational Autoencoder.

Kaito Hirasawa,Takahiro Ogawa,Keisuke Maeda,Miki Haseyama

doi:10.3390/s21062045

Kaito Hirasawa, Takahiro Ogawa + Show 2 more

Open Access

https://doi.org/10.3390/s21062045

Copy DOI

Journal: Sensors (Basel, Switzerland)	Publication Date: Mar 14, 2021
Citations: 2	License type: CC BY 4.0

Affiliation: Hokkaido University

Abstract

A new method for the detection of important scenes in baseball videos via a time-lag-aware multimodal variational autoencoder (Tl-MVAE) is presented in this paper. Tl-MVAE estimates latent features calculated from tweet, video, and audio features extracted from tweets and videos. Then, important scenes are detected by estimating the probability of the scene being important from estimated latent features. It should be noted that there exist time-lags between tweets posted by users and videos. To consider the time-lags between tweet features and other features calculated from corresponding multiple previous events, the feature transformation based on feature correlation considering such time-lags is newly introduced to the encoder in MVAE in the proposed method. This is the biggest contribution of the Tl-MVAE. Experimental results obtained from actual baseball videos and their corresponding tweets show the effectiveness of the proposed method.

Highlights

We propose a new method for the detection of important scenes in baseball videos via the multimodal variational autoencoder (MVAE) considering time-lags between tweets and corresponding multiple previous events
Since feature transformation based on feature correlations considering such time-lags is introduced to the time-lagaware multimodal variational autoencoder (Tl-MVAE), the proposed method can derive latent features that are efficient for the consideration of the relationships between tweets and videos
Our biggest contribution is the development of a method for the detection of important scenes in baseball videos via the Tl-MVAE, which can consider time-lags between tweets and their corresponding multiple previous events

Summary

Introduction

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. We propose a new method for the detection of important scenes in baseball videos via the MVAE considering time-lags between tweets and corresponding multiple previous events. Since feature transformation based on feature correlations considering such time-lags is introduced to the Tl-MVAE, the proposed method can derive latent features that are efficient for the consideration of the relationships between tweets and videos. From this novelty, the proposed method can realize accurate detection based on the Tl-MVAE using multimodal features extracted from tweets and videos. We newly introduce the novel Tl-MVAE to the detection of important scenes

Detection of Important Scenes via the TL-MVAE

Encoder

Decoder

Important Scene Detector

Final Loss

Experimental Setting

Performance Evaluation

Conclusions

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Detection of Important Scenes in Baseball Videos via a Time-Lag-Aware Multimodal Variational Autoencoder.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)

Lead the way for us

Similar Papers

Time-Lag Aware Latent Variable Model for Prediction of Important Scenes Using Baseball Videos and Tweets
Kaito Hirasawa ... Keisuke Maeda
Sensors | VOL. 22
Kaito Hirasawa, et. al.Kaito Hirasawa ... Keisuke Maeda
23 Mar 2022
Sensors | VOL. 22

Detection of Important Scenes in Baseball Videos via Bidirectional Time Lag Aware Deep Multiset Canonical Correlation Analysis
Kaito Hirasawa ... Keisuke Maeda
IEEE Access | VOL. 9
Kaito Hirasawa, et. al.Kaito Hirasawa ... Keisuke Maeda
01 Jan 2020
IEEE Access | VOL. 9

Time-Lag Aware Multi-Modal Variational Autoencoder Using Baseball Videos And Tweets For Prediction Of Important Scenes
Kaito Hirasawa ... Miki Haseyama
-
Kaito Hirasawa, et. al.Kaito Hirasawa ... Miki Haseyama
19 Sep 2021
19 Sep 2021

Semantic event detection in baseball videos based on a multi-output hidden Markov model
Yin-Fu Huang ... Jyun-Jhang Huang
-
Yin-Fu Huang, et. al.Yin-Fu Huang ... Jyun-Jhang Huang
21 Mar 2011
21 Mar 2011

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Detection of Important Scenes in Baseball Videos via a Time-Lag-Aware Multimodal Variational Autoencoder.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)