Visual Summarization Research Articles

In recent years, we have witnessed the flourish of multimedia data on the Internet. To facilitate humans in accessing and managing the explosively growing multimedia contents, extensive research efforts have been dedicated to automatic multimedia analysis and processing in the past decades, such as categorization, annotation and indexing. However, despite great advances achieved, several key difficulties still exist, such as the well-known semantic gap in multimedia modeling. It is evident from recent results that, without additional information resources, most of the semantic gap problems can hardly be solved automatically within the near future. On the other hand, we have witnessed the power of collective human efforts in the Web 2.0 era in providing high-quality tags and comments to large amounts of images and videos in sites such as Flickr and YouTube. In fact, a lot more can be accomplished through simple online games such as the ESP. Hence, more and more researchers believe that a possible approach to addressing the semantic gap problem is to incorporate the efforts of humans into the computational process, i.e., by combining human intelligence and automated computer processing to jointly tackle the problems in a collaborative manner. The past decade has witnessed the increase of such efforts, such as relevance feedback in content-based image retrieval, active learning in multimedia modeling, the interactive video search evaluation task in TRECVID, new search and browsing interfaces in VideoOlympics to facilitate humans’ interaction, and the recent human computation efforts such as the ESP game on Google image search website. This special issue is organized with the purpose of introducing novel research work on interactive multimedia computing. Submissions have come from an open call for paper. With the assistance of dedicated referees, five papers have been selected after two rounds of rigorous reviews. These papers cover widely subtopics of interactive multimedia computing, including game-based image annotation, interactive TV, interactive cartoon synthesis, and so on. In the first paper ‘‘Adding Semantics to Image Region Annotations with the Name-It-Game’’, Steggink and Snoek introduce a system that accomplishes region-level image annotation with a game. It establishes a set of keywords that describe objects by exploring WordNet, and the keywords are assigned to image regions with a two-player ‘‘reveal and guess’’ game. They also explore WordNet to address the word ambiguity problem. In addition to introducing the system, another contribution of the paper is its review of existing manual image annotation techniques, in particular the comprehensive study of game-based annotation. In the second paper ‘‘Interactive Browsing via Diversified Visual Summarization for Image Search Results’’, Wang et al. introduce a scheme for the summarization and browsing of image search results. It adopts a dynamic absorbing random walk approach to summarize the image search results. The summarization is visualized on a 2D panel and users’ browsing is facilitated with dynamic scale change and a browsing path tracking tool. Experiments with a set of diverse queries have demonstrated the effectiveness of the approach. The third paper, ‘‘Security and Privacy Requirements in Interactive TV’’, discusses the security and privacy issues in the context of interactive TV. It introduces an interactive M. Wang (&) J. Tang T.-S. Chua National University of Singapore, Singapore, Singapore e-mail: eric.mengwang@gmail.com

Mining multimedia information in the Web is in general an arduous task, due to the fact that, (a) humans perceive media content using high level concepts, (b) the subjective and vagueness of content interpretation, and (c) the fact that relevant data are often hidden in a huge amount of irrelevant information. In addition, delivering and distributing the retrieved information to a wide range of terminal devices of different properties over a wide range of networks to users of different preferences requires new tools and mechanisms for content transformation and adaptation. Other problems concern the language that the data are stored, which may not be the user's preferred language. To address these issues we propose an integrated, reconfigurable, adaptable and open architecture for mining, indexing and retrieving multimedia information based on a mobile agent technology scheme. The proposed architecture consists of three integral subsystems: the acquisition module, responsible for searching and retrieving media data (both textual and visual), the transformation module, able to adapt and transform the mined information to other forms of representation, and the distribution module for delivering and adapting the retrieved data in terms of terminal devices, network channels and user's preferences. The system is based on a reconfigurable architecture which is able to dynamically and automatically update the system response to user's actual needs and preferences, by extending descriptor classes that are considered more relevant by the users. New innovative algorithms are presented in this paper both at each system module as well as in the system integration. The system supports efficient content adaptation mechanisms, textual and visual summarization (both sequential and hierarchical), automatic language translation, ontological representation, visual processing and web-based data mining. Experimental analysis on real-life web sites has been performed to test the efficiency of the proposed scheme and compare it with other approaches presented in the literature.

Visual Summarization Research Articles

Related Topics

Articles published on Visual Summarization

EventThread: Visual Summarization and Stage Analysis of Event Sequence Data.

Molecular Surface Maps.

ETGraph: A graph-based approach for visual analytics of eye-tracking data

Homophily in coauthorship networks of East European sociologists.

Visual summarization of image collections by fast RANSAC

Visualizing Complex Data With Embedded Plots

Designing a Ranking System for Product Search Engine Based on Mining UGC

Effective browsing of image search results via diversified visual summarization by clustering and refining clusters

Perception of Average Value in Multiclass Scatterplots

Learning Crowdsourced User Preferences for Visual Summarization of Image Collections

Generating Visual Summaries of Geographic Areas Using Community-Contributed Images

Interactive multimedia computing

MI-MERCURY: A mobile agent architecture for ubiquitous retrieval and delivery of multimedia information

Web-based Digital Video Sequencing System

Content-based access to video objects: Temporal Segmentation, visual summarization, and feature extraction

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Visual Summarization Research Articles

Related Topics

Articles published on Visual Summarization

EventThread: Visual Summarization and Stage Analysis of Event Sequence Data.

Molecular Surface Maps.

ETGraph: A graph-based approach for visual analytics of eye-tracking data

Homophily in coauthorship networks of East European sociologists.

Visual summarization of image collections by fast RANSAC

Visualizing Complex Data With Embedded Plots

Designing a Ranking System for Product Search Engine Based on Mining UGC

Effective browsing of image search results via diversified visual summarization by clustering and refining clusters

Perception of Average Value in Multiclass Scatterplots

Learning Crowdsourced User Preferences for Visual Summarization of Image Collections

Generating Visual Summaries of Geographic Areas Using Community-Contributed Images

Interactive multimedia computing

MI-MERCURY: A mobile agent architecture for ubiquitous retrieval and delivery of multimedia information

Web-based Digital Video Sequencing System

Content-based access to video objects: Temporal Segmentation, visual summarization, and feature extraction