Audiovisual Modeling Research Articles

Conscious awareness plays a major role in human cognition and adaptive behavior, though its function in multisensory integration is not yet fully understood, hence, questions remain: How does the brain integrate the incoming multisensory signals with respect to different external environments? How are the roles of these multisensory signals defined to adhere to the anticipated behavioral-constraint of the environment? This work seeks to articulate a novel theory on conscious multisensory integration (CMI) that addresses the aforementioned research challenges. Specifically, the well-established contextual field (CF) in pyramidal cells and coherent infomax theory (Kay et al., 1998; Kay and Phillips, 2011) is split into two functionally distinctive integrated input fields: local contextual field (LCF) and universal contextual field (UCF). LCF defines the modulatory sensory signal coming from some other parts of the brain (in principle from anywhere in space-time) and UCF defines the outside environment and anticipated behavior (based on past learning and reasoning). Both LCF and UCF are integrated with the receptive field (RF) to develop a new class of contextually-adaptive neuron (CAN), which adapts to changing environments. The proposed theory is evaluated using human contextual audio-visual (AV) speech modeling. Simulation results provide new insights into contextual modulation and selective multisensory information amplification/suppression. The central hypothesis reviewed here suggests that the pyramidal cell, in addition to the classical excitatory and inhibitory signals, receives LCF and UCF inputs. The UCF (as a steering force or tuner) plays a decisive role in precisely selecting whether to amplify/suppress the transmission of relevant/irrelevant feedforward signals, without changing the content e.g., which information is worth paying more attention to? This, as opposed to, unconditional excitatory and inhibitory activity in existing deep neural networks (DNNs), is called conditional amplification/suppression.

Read full abstract

Recent technological developments have enabled us to synthesize images and sounds concurrently within single computers, even in real time, giving birth to novel and genuinely integrated audiovisual art forms (Hunt et al. 1998). But how should we organize and compose such works? Given certain soundscape, what would form an appropriate sequence of images to that soundscape? Given certain sequence of images, what soundscape is appropriate to it? If the image sequence and the soundscape are being created concurrently, how should we compose them? Authors have proposed different approaches to these questions (Whitney 1980; Hunt et al. 1998; Lokki et al. 1998; Rudi 1998; Kim and Lipscomb 2003; Gerhard and Hepting 2004; Yeo et al. 2004). These approaches differ significantly, and they are based on diverse principles, such as correspondence of aural to visual harmony, audiovisual modeling of mathematical principles, audiovisual rendering, data sonification, algorithmic control, and parameter space exploration. It is important to notice that there is no easy or correct solution, because the problem we must deal with lies in combining two entirely different media in time (Hunt et al. 1998). A fuzzy-logic approach to the challenge of composing both sound and moving image within coherent framework is proposed here as an alternative solution. This approach is based on fuzzy-logic model that enables flexible mapping of either aural or visual information onto the other, and it is able to generate complex audiovisual relationships by very simple means. This mapping strategy is inspired by two fundamental ideas: isomorphism and synaesthesia. Isomorphism applies when two complex structures can be mapped onto each other based on the fact that changes in one modality consistently cause changes in another modality (Hofstadter 1999). The word synaesthesia comes directly from the Greek syn (together) and aisthesis (perceive; Van Campen 1999), thus meaning a union of the senses. Synaesthesia occurs when stimulation in one sensory modality automatically triggers perception in second modality in the absence of any direct stimulation to this second modality (Harrison and Baron-Cohen 1997). This article is structured as follows. First, the motivations for this work are presented and discussed, including discussions of audiovisual domains, synaesthesia, and isomorphisms. Second, fuzzy logic is introduced, including its main features. Third, details of the proposed fuzzy-logic mapper are presented. Fourth, ID-FUSIONES (2001) and TIME EXPOSURE (2005), computer music-video works that use the proposed model, are discussed as actual implementations of the approach described in this article. Finally, conclusions and directions for future work are addressed.

Read full abstract

Audiovisual Modeling Research Articles

Articles published on Audiovisual Modeling

Audio–Visual Modeling and its Effect on Behavior of Pediatric Patient: An Observational Study

Improving Audio-Visual Segmentation with Bidirectional Generation

Effectiveness of Audio-visual Education through Digital Media Platforms Regarding Quality of Life Amongst Breast Cancer Person

Conscious Multisensory Integration: Introducing a Universal Contextual Field in Biological and Deep Artificial Neural Networks.

Audiovisual modeling: An efficient, time-saving, radiation-specified method of reducing dental anxiety in children undergoing panoramic radiographic imaging and IOPA radiographic imaging

Effect of Audiovisual Modeling on 5–10-year-old Children\'s Anxiety and Cooperation Behavior in First and Second Dental Visits

Joint Audiovisual Hidden Semi-Markov Model-Based Speech Synthesis

Effectiveness of audiovisual modeling on the behavioral change toward oral and dental care in children with autism

Child′s dental fear: Cause related factors and the influence of audiovisual modeling

A Fuzzy-Logic Mapper for Audiovisual Media

The Effects of Audiovisual Modeling and Reading on Baccalaureate Students' Patient Teaching

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Audiovisual Modeling Research Articles

Articles published on Audiovisual Modeling

Audio–Visual Modeling and its Effect on Behavior of Pediatric Patient: An Observational Study

Improving Audio-Visual Segmentation with Bidirectional Generation

Effectiveness of Audio-visual Education through Digital Media Platforms Regarding Quality of Life Amongst Breast Cancer Person

Conscious Multisensory Integration: Introducing a Universal Contextual Field in Biological and Deep Artificial Neural Networks.

Audiovisual modeling: An efficient, time-saving, radiation-specified method of reducing dental anxiety in children undergoing panoramic radiographic imaging and IOPA radiographic imaging

Effect of Audiovisual Modeling on 5–10-year-old Children\'s Anxiety and Cooperation Behavior in First and Second Dental Visits

Joint Audiovisual Hidden Semi-Markov Model-Based Speech Synthesis

Effectiveness of audiovisual modeling on the behavioral change toward oral and dental care in children with autism

Child′s dental fear: Cause related factors and the influence of audiovisual modeling

A Fuzzy-Logic Mapper for Audiovisual Media

The Effects of Audiovisual Modeling and Reading on Baccalaureate Students' Patient Teaching