Abstract

CAPTCHA(Completely Automated Public Turing test to Tell Computers and Humans Apart) can be used to protect data from auto bots. Countless kinds of CAPTCHAs are thus designed, while we most frequently utilize text-based scheme because of most convenience and user-friendly way \cite{bursztein2011text}. Currently, various types of CAPTCHAs need corresponding segmentation to identify single character due to the numerous different segmentation ways. Our goal is to defeat the CAPTCHA, thus firstly the CAPTCHAs need to be split into character by character. There isn't a regular segmentation algorithm to obtain the divided characters in all kinds of examples, which means that we have to treat the segmentation individually. In this paper, we build a whole system to defeat the CAPTCHAs as well as achieve state-of-the-art performance. In detail, we present our self-adaptive algorithm to segment different kinds of characters optimally, and then utilize both the existing methods and our own constructed convolutional neural network as an extra classifier. Results are provided showing how our system work well towards defeating these CAPTCHAs.

Highlights

  • CAPTCHA is used to tell human beings and computer programs apart automatically

  • CAPTCHA designers change the combination of coloring numbers and characters and so on which can be recognized by people but not the auto-mated bots [2] [14], besides both companies and individuals would like to apply text-based CAPTCHAs most frequently because of the convenience.To defeat text-based CAPTCHA, three steps are normally needed: preprocessing by denoising, segmentation to get individual characters and recognition to identify each character

  • We present our novel adaptive algorithm to optimize the segmentation in defeating the CAPTCHAs which will be further discussed

Read more

Summary

INTRODUCTION

CAPTCHA designers change the combination of coloring numbers and characters and so on which can be recognized by people but not the auto-mated bots [2] [14], besides both companies and individuals would like to apply text-based CAPTCHAs most frequently because of the convenience.To defeat text-based CAPTCHA, three steps are normally needed: preprocessing by denoising, segmentation to get individual characters and recognition to identify each character. Those three steps are treated as important.

BACKGROUND
Recognition
Denoise Filter
Adaptive algorithm
Convolutional Neural Network Construction
Findings
CONCLUSIONS

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.