Text detection in natural scenes plays a key role in recognition and understanding of text in computer vision, while providing the basic possibilities for combining with Natural Language Processing (NLP). The ability of detection for small text and adjacent text is weak although the text detection in natural scenes has evolved from the horizontal text detection initially to the arbitrary shape text detection. Meanwhile, the environment of natural scenes is complex and changeable, which is a huge challenge for text detection. In this paper, we discuss and compare the horizontal and multi-directional detection methods based on target detection, as well as the curved and arbitrary shape detection methods based on the instance segmentation and semantic segmentation. The algorithm ideas are described and their advantages and disadvantages are analysed. Moreover, we found two research routes of curved text and arbitrary shape text detection known as Top-Down and Bottom-Up. According to the evaluations, PMTD [20] performs best in multi-directional text detection, while SPCNet [16] wins in the curve and arbitrary shape text detection.