TDAF: Top-Down Attention Framework for Vision Tasks

Bo Pang,Yizhuo Li,Jiefeng Li,Muchen Li,Hanwen Cao,Cewu Lu

doi:10.1609/aaai.v35i3.16339

Abstract

Human attention mechanisms often work in a top-down manner, yet it is not well explored in vision research. Here, we propose the Top-Down Attention Framework (TDAF) to capture top-down attentions, which can be easily adopted in most existing models. The designed Recursive Dual-Directional Nested Structure in it forms two sets of orthogonal paths, recursive and structural ones, where bottom-up spatial features and top-down attention features are extracted respectively. Such spatial and attention features are nested deeply, therefore, the proposed framework works in a mixed top-down and bottom-up manner. Empirical evidence shows that our TDAF can capture effective stratified attention information and boost performance. ResNet with TDAF achieves 2.0% improvements on ImageNet. For object detection, the performance is improved by 2.7% AP over FCOS. For pose estimation, TDAF improves the baseline by 1.6%. And for action recognition, the 3D-ResNet adopting TDAF achieves improvements of 1.7% accuracy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

TDAF: Top-Down Attention Framework for Vision Tasks

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: May 18, 2021
Citations: 4

Similar Papers

Phenomenal awareness can emerge without attention
Jaan Aru ... Talis Bachmann
Frontiers in Human Neuroscience | VOL. 7
Jaan Aru, et. al.Jaan Aru ... Talis Bachmann
01 Jan 2013
Frontiers in Human Neuroscience | VOL. 7

The Link between Visual Attention and the Subjective Perception of Time
M V Konstantinova ... L V Tereshchenko
Neuroscience and Behavioral Physiology | VOL. 49
M V Konstantinova, et. al.M V Konstantinova ... L V Tereshchenko
01 Nov 2019
Neuroscience and Behavioral Physiology | VOL. 49

Interactions between Visual Attention and Episodic Retrieval: Dissociable Contributions of Parietal Regions during Gist-Based False Recognition
Scott A Guerin ... Daniel L Schacter
Neuron | VOL. 75
Scott A Guerin, et. al.Scott A Guerin ... Daniel L Schacter
01 Sep 2012
Neuron | VOL. 75

Top-Down Neural Attention by Excitation Backprop
Jianming Zhang ... Stan Sclaroff
-
Jianming Zhang, et. al.Jianming Zhang ... Stan Sclaroff
01 Jan 2015
01 Jan 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

TDAF: Top-Down Attention Framework for Vision Tasks

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence