Abstract

<p indent=0mm>In view of the issue that current mainstream segmentation-based text detection methods is difficult to achieve high detection speed due to complex post-processing to ensure detection accuracy, a scene text detection method is proposed which applies pyramid attention network and position attention module. First, it adopts pyramid attention network to perform feature extraction and semantic segmentation. Meanwhile, it adopts position attention module in high-level features, which strengthens the weights of similar objects in the image to enhance the effect of text detection. Finally, it adopts a simple and effective post-processing algorithm to increase detection speed under the premise of high detection accuracy. Experimental results show that in Total-text datasets, using light-weight backbone network, the method has great advantages on detection speed, and while using deeper backbone network, the method achieves the state of the art result and has a 2.0% lead on detection accuracy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.