Multimodal Semantics Research Articles

User-generated video content has grown tremendously fast to the point of outpacing professional content creation. In this work we develop methods that analyze contextual information of multiple user-generated videos in order to obtain semantic information about public happenings (e.g., sport and live music events) being recorded in these videos. One of the key contributions of this work is a joint utilization of different data modalities, including such captured by auxiliary sensors during the video recording performed by each user. In particular, we analyze GPS data, magnetometer data, accelerometer data, video- and audio-content data. We use these data modalities to infer information about the event being recorded, in terms of layout (e.g., stadium), genre, indoor versus outdoor scene, and the main area of interest of the event. Furthermore we propose a method that automatically identifies the optimal set of cameras to be used in a multicamera video production. Finally, we detect the camera users which fall within the field of view of other cameras recording at the same public happening. We show that the proposed multimodal analysis methods perform well on various recordings obtained in real sport events and live music performances.

A patient suffering from semantic dementia is described who consistently demonstrated the preserved ability to support specific types of semantic judgements from visual, but not from verbal, input. In addition the representations accessed from visual input were found to trigger complex behavioural schemata, while with verbal materials the patient performed almost invariably at chance level. A preliminary description is given of the nature of visual semantic representations, and the privileged relationship between this modality of input and some aspects of semantic knowledge is also explored. The richness of the semantic representations accessed from visual input can be accommodated within the “Multimodal Semantics” framework; alternative views, derived from the Identification Semantics and the Organized Unitary Content Hypothesis, are also discussed.

Multimodal Semantics Research Articles

Related Topics

Articles published on Multimodal Semantics

"That thing in New York": Impaired naming vs. preserved recognition of unique entities following an anterior temporal lobe lesion

Toward a pluri-component, multimodal, and dynamic organization of the ventral semantic stream in humans: lessons from stimulation mapping in awake patients

Multimodal Semantics Extraction from User-Generated Videos

Modality-Specific Operations in Semantic Dementia

Prior and rennie on times and tenses

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Multimodal Semantics Research Articles

Related Topics

Articles published on Multimodal Semantics

"That thing in New York": Impaired naming vs. preserved recognition of unique entities following an anterior temporal lobe lesion

Toward a pluri-component, multimodal, and dynamic organization of the ventral semantic stream in humans: lessons from stimulation mapping in awake patients

Multimodal Semantics Extraction from User-Generated Videos

Modality-Specific Operations in Semantic Dementia

Prior and rennie on times and tenses