GCCNet: Grouped channel composition network for scene text detection

Chang Liu,Chun Yang,Jie-Bo Hou,Long-Huang Wu,Xiao-Bin Zhu,Lei Xiao,Xu-Cheng Yin

doi:10.1016/j.neucom.2021.04.095

Abstract

Anchor mechanism is widely applied in scene text detection methods and demonstrates promising performance. However, existing anchor mechanisms have two major limitations, namely handcrafted anchor design and hard-wired anchor assignment. We propose a novel Grouped Channels Composition(GCC) block to achieve the data-driven anchor design and adaptive anchor assignment. To be more specific, our GCC block uses optimizable anchor functions rather than handcrafted ones to achieve data-drive anchor design. In our GCC block, an adaptive anchor assignment is achieved with the attention mechanism instead of empirically assigning anchor according to the Intersection Over Union (IoU) between ground truth and targets. We then build a corresponding network named GCCNet with our GCC blocks. We also propose a Unified Loss Weighting module to alleviate the inconsistency between classification score and localization accuracy. Experiments conducted on publicly available datasets demonstrate the state-of-the-art performance of our methods.

Full Text