Time-sync Comments Research Articles

Recent years have witnessed the booming of online social media platforms with embracing the popular service called “Time-Sync Comment”, which supports the viewers to share their time-sync opinions along with video content. In this way, we observe that numerous semantically-altered terms, or “Memes”, were created by niche users to express their unique ideas and emotions, and further attracted a large group of viewers with better activity and enthusiasm. Unfortunately, since the memes were created based on domain-specific knowledge and semantically varied depending on the multimodal context in videos, newcomers may fail to comprehend the semantic connotation of memes, which may severely impair their user-experiences. To deal with this issue, in this article, we propose a novel meme explanation framework, called ProMDE, to automatically capture and comprehend the memes in time-sync comments, which could further benefit the viewers with meme explanation service. Specifically, we first iteratively reconstruct the original time-sync comments compared with visual embedding to detect the semantically-altered terms as meme candidates. Afterward, based on the guides from the domain-specific corpus, visual and textual features will be fused to represent the context-aware multimodal cues. Moreover, to accurately describe the commonly-seen homophones in memes, i.e., they have the same pronunciation but different word-spelling expressions, we integrate the phonetic symbols as an additional modality to enhance the framework. Finally, we utilize a Transformer-based decoder to generate the natural language explanation for captured memes. Extensive experiments on a large real-world dataset prove that our framework could significantly outperform several state-of-the-art baseline methods, demonstrating the efficacy of modeling multimodal context and pronunciation for meme detection and explanation.

Read full abstract

With the rapid increase of mobile devices and online media, more and more people prefer posting/viewing videos online. Generally, these videos are presented on video streaming sites with image thumbnails and text titles. While facing huge amounts of videos, a viewer clicks through a certain video with high probability because of its eye-catching thumbnail. However, current video thumbnails are created manually, which is time-consuming and quality-unguaranteed. And static image thumbnails contain very limited information of the corresponding videos, which prevents users from successfully clicking what they really want to view. In this paper, we address a novel problem, namely GIF thumbnail generation, which aims to automatically generate GIF thumbnails for videos and consequently boost their Click-Through-Rate (CTR). Here, a GIF thumbnail is an animated GIF file consisting of multiple segments from the video, containing more information of the target video than a static image thumbnail. To support this study, we build the first GIF thumbnails benchmark dataset that consists of 1070 videos covering a total duration of 69.1 hours, and 5394 corresponding manually-annotated GIFs. To solve this problem, we propose a learning-based automatic GIF thumbnail generation model, which is called Generative Variational Dual-Encoder (GEVADEN). As not relying on any user interaction information (e.g. time-sync comments and real-time view counts), this model is applicable to newly-uploaded/rarely-viewed videos. Experiments on our built dataset show that GEVADEN significantly outperforms several baselines, including video-summarization and highlight-detection based ones. Furthermore, we develop a pilot application of the proposed model on an online video platform with 9814 videos covering 1231 hours, which shows that our model achieves a 37.5% CTR improvement over traditional image thumbnails. This further validates the effectiveness of the proposed model and the promising application prospect of GIF thumbnails.

Read full abstract

Time-sync Comments Research Articles

Related Topics

Articles published on Time-sync Comments

Fragment of Interest: Personalized Video Fragment Recommendation with Inter-Fragment & Intra-Fragment Contextual Effect

Sentiment classification of time-sync comments: A semi-supervised hierarchical deep learning method

Comprehending the Gossips: Meme Explanation in Time-Sync Video Comment via Multimodal Cues

Sentiment Analysis on Online Videos by Time-Sync Comments.

CoEvo-Net: Coevolution Network for Video Highlight Detection

Encrypted Live Streaming Channel Identification With Time-Sync Comments

GIF Thumbnails: Attract More Clicks to Your Videos

Time-sync comments denoising via graph convolutional and contextual encoding

Discovering attractive segments in the user-generated video streams

Time-Sync Video Tag Extraction Using Semantic Association Graph

Video clip recommendation model by sentiment analysis of time-sync comments

Stories That Big Danmaku Data Can Tell as a New Media

Bridging Video Content and Comments: Synchronized Video Description with Temporal Summarization of Crowdsourced Time-Sync Comments

Reading the Videos: Temporal Labeling for Crowdsourced Time-Sync Videos Based on Semantic Embedding

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Time-sync Comments Research Articles

Related Topics

Articles published on Time-sync Comments

Fragment of Interest: Personalized Video Fragment Recommendation with Inter-Fragment &amp; Intra-Fragment Contextual Effect

Sentiment classification of time-sync comments: A semi-supervised hierarchical deep learning method

Comprehending the Gossips: Meme Explanation in Time-Sync Video Comment via Multimodal Cues

Sentiment Analysis on Online Videos by Time-Sync Comments.

CoEvo-Net: Coevolution Network for Video Highlight Detection

Encrypted Live Streaming Channel Identification With Time-Sync Comments

GIF Thumbnails: Attract More Clicks to Your Videos

Time-sync comments denoising via graph convolutional and contextual encoding

Discovering attractive segments in the user-generated video streams

Time-Sync Video Tag Extraction Using Semantic Association Graph

Video clip recommendation model by sentiment analysis of time-sync comments

Stories That Big Danmaku Data Can Tell as a New Media

Bridging Video Content and Comments: Synchronized Video Description with Temporal Summarization of Crowdsourced Time-Sync Comments

Reading the Videos: Temporal Labeling for Crowdsourced Time-Sync Videos Based on Semantic Embedding

Fragment of Interest: Personalized Video Fragment Recommendation with Inter-Fragment & Intra-Fragment Contextual Effect