Abstract

Problem definition: most of the existing works investigate the recognition of a fixed-length CAPTCHA, but the authors suggest using knowledge distillation to simulate the operation of recurrent-convolutional models, which have proven themselves well in the task of predicting the dynamic length of characters in images. The rapid development of deep learning systems, the recognition quality of which has reached the level of human vision, makes the method of protection using CAPTCHA increasingly ineffective. In addition, such protection imposes high requirements on the characteristics of the devices on which recognition is performed. The research carried out in this work allowed us to propose an effective method of training CNN on inaccurate data for automatic circumvention of text CAPTCHAS on mobile devices. Purpose: acquiring a lightweight and high-quality model for text CAPTCHA recognition that can work on mobile devices. Results: the paper describes a method for training a lightweight model on inaccurate markup obtained from another model. The influence of the size of the training sample on the quality of recognition, the speed of the model on various end devices is studied on the example of a popular social network. Practical significance: The proposed method allows you to train convolutional models to bypass the protection of websites-text CAPTCHA, which are undemanding to the characteristics of devices. The analysis of the model errors allows us to make recommendations for improving ways to counteract automatic recognition.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.