Top-down Attention Model Research Articles

Visual storytelling aims at producing a narrative paragraph for a given photo album automatically. It introduces more new challenges than individual image paragraph descriptions, mainly due to the difficulty in preserving coherent topics and in generating diverse phrases to depict the rich content of a photo album. Existing attention-based models that lack higher-level guiding information always result in a deviation between the generated sentence and the topic expressed by the image. In addition, these widely applied language generation approaches employing standard beam search tend to produce monotonous descriptions. In this work, a coherent visual storytelling (CoVS) framework is designed to address the above-mentioned problems. Specifically, in the encoding phase, an image sequence encoder is designed to efficiently extract visual features of the input photo album. Then, the novel parallel top-down visual and topic attention (PTDVTA) decoder is constructed via a topic-aware neural network, a parallel top-down attention model, and a coherent language generator. Concretely, visual attention focuses on the attributes and the relationships of the objects, while topic attention integrating a topic-aware neural network could improve the coherence of generated sentences. Eventually, a phrase beam search algorithm with <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$n$ </tex-math></inline-formula> -gram hamming diversity is further designed to optimize the expression diversity of the generated story. To justify the proposed CoVS framework, extensive experiments are conducted on the VIST dataset, which shows that CoVS can automatically generate coherent and diverse stories in a more natural way. Moreover, CoVS obtains better performance than state-of-the-art baselines on BLEU-4 and METEOR scores, while maintaining good CIDEr and ROUGH_L scores. The source code of this work can be found in <uri xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">https://mic.tongji.edu.cn</uri> .

Impairments in attentional behaviors, including over-selectivity, under-selectivity, distractibility and difficulty in shift of attention, are widely reported in several developmental disorders, including autism. Uncharacteristic inhibitory to excitatory neuronal number ratio (IER) and abnormal synaptic strength levels in the brain are two broadly accepted neurobiological disorders observed in autistic individuals. These neurobiological findings are contrasting and their relation to the atypical attentional behaviors is not clear yet. In this paper, we take a computational approach to investigate the relation of imbalanced IER and abnormal synaptic strength to some well-documented spectrum of attentional impairments. The computational model is based on a modified version of a biologically plausible neural model of two competing minicolumns in IT cortex augmented with a simple model of top-down attention. Top-down attention is assumed to amplify (attenuates) attended (unattended) stimulus. The inhibitory synaptic strength parameter in the model is set such that typical attentional behavior is emerged. Then, according to related findings, the parameter is changed and the model's attentional behavior is considered. The simulation results show that, without any change in top-down attention, the abnormal inhibitory synaptic strength values--and IER imbalance- result in over-selectivity, under-selectivity, distractibility and difficulty in shift of attention in the model. It suggests that the modeled neurobiological abnormalities can be accounted for the attentional deficits. In addition, the atypical attentional behaviors do not necessarily point to impairments in top-down attention. Our simulations suggest that limited changes in the inhibitory synaptic strength and variations in top-down attention signal affect the model's attentional behaviors in the same way. So, limited deficits in the inhibitory strength may be alleviated by appropriate change in top-down attention biasing. Nevertheless, our model proposes that this compensation is not possible for very high and very low values of the inhibitory strength.

Top-down Attention Model Research Articles

Related Topics

Articles published on Top-down Attention Model

Coherent Visual Storytelling via Parallel Top-Down Visual and Topic Attention

Stack-VS: Stacked Visual-Semantic Attention for Image Caption Generation

Attention-based active visual search for mobile robots

A novel visual attention method for target detection from SAR images

Vulnerable Plaque Recognition Based on Attention Model with Deep Convolutional Neural Network.

Fabric defect detection using an integrated model of bottom-up and top-down visual attention

Priming of awareness or how not to measure visual awareness

A Top-down Attention Model Using Naive Bayesian Network

Top-down attention based on object representation and incremental memory for knowledge building and inference

The co-attention model for tiny activity analysis

Computational model of excitatory/inhibitory ratio imbalance role in attention deficit disorders

Selective attention-based novelty scene detection in dynamic environments

MODELING AUDITORY PATHWAY FOR INTELLIGENT INFORMATION ACQUISITION

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Top-down Attention Model Research Articles

Related Topics

Articles published on Top-down Attention Model

Coherent Visual Storytelling via Parallel Top-Down Visual and Topic Attention

Stack-VS: Stacked Visual-Semantic Attention for Image Caption Generation

Attention-based active visual search for mobile robots

A novel visual attention method for target detection from SAR images

Vulnerable Plaque Recognition Based on Attention Model with Deep Convolutional Neural Network.

Fabric defect detection using an integrated model of bottom-up and top-down visual attention

Priming of awareness or how not to measure visual awareness

A Top-down Attention Model Using Naive Bayesian Network

Top-down attention based on object representation and incremental memory for knowledge building and inference

The co-attention model for tiny activity analysis

Computational model of excitatory/inhibitory ratio imbalance role in attention deficit disorders

Selective attention-based novelty scene detection in dynamic environments

MODELING AUDITORY PATHWAY FOR INTELLIGENT INFORMATION ACQUISITION