FEGAN: A High-Performance Font Enhancement Network for Text CAPTCHA Preprocessing

  • Abstract
  • References
  • Similar Papers
Abstract
Translate article icon Translate Article Star icon
Take notes icon Take Notes

This study aims to address performance deficiencies in CAPTCHA preprocessing methods that impede the accurate recognition of text CAPTCHAs, which are crucial for identifying security vulnerabilities. To improve CAPTCHA preprocessing methods, a similar font is initially searched and acquired by manually removing obstructing pixels from a target CAPTCHA and retaining the font part. Using the found font, a pseudo-dataset is generated containing a large number of clean and dirty pairs to train to the proposed supervised Font Enhancement Generative Adversarial Network (FEGAN), which is designed to effectively eliminate non-font-related interferences and preserve the font outlines. Test results show that FEGAN can improve the recognizer’s accuracy by approximately 16% to 50% on the M-CAPTCHA dataset (a publicly available dataset on Kaggle) and 5% to 35% on the P-CAPTCHA dataset (generated using the Python ImageCaptcha package), substantially outperforming the Multiview-filtering-based preprocessing approach.

ReferencesShowing 10 of 23 papers
  • Open Access Icon
  • Cite Count Icon 3868
  • 10.1007/s11263-019-01228-7
Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization
  • Oct 11, 2019
  • International Journal of Computer Vision
  • Ramprasaath R Selvaraju + 5 more

  • Open Access Icon
  • Cite Count Icon 17
  • 10.1145/3378446
Using Generative Adversarial Networks to Break and Protect Text Captchas
  • Apr 17, 2020
  • ACM Transactions on Privacy and Security
  • Guixin Ye + 8 more

  • Open Access Icon
  • PDF Download Icon
  • Cite Count Icon 5
  • 10.3390/app14125016
Adaptive CAPTCHA: A CRNN-Based Text CAPTCHA Solver with Adaptive Fusion Filter Networks
  • Jun 8, 2024
  • Applied Sciences
  • Xing Wan + 2 more

  • Open Access Icon
  • Cite Count Icon 21
  • 10.1145/3505226
Counteracting Dark Web Text-Based CAPTCHA with Generative Adversarial Learning for Proactive Cyber Threat Intelligence
  • Mar 10, 2022
  • ACM Transactions on Management Information Systems
  • Ning Zhang + 3 more

  • Cite Count Icon 2
  • 10.1109/inocon60754.2024.10511373
CAPTCHA in Web Security and Deep-Captcha Configuration based on Machine learning
  • Mar 1, 2024
  • Sahil Sharma + 1 more

  • Cite Count Icon 3
  • 10.1109/cedl60560.2023.00040
A Review on Text-based CAPTCHA Breaking Based on Deep Learning Methods
  • Jun 29, 2023
  • Wan Xing + 3 more

  • Open Access Icon
  • Cite Count Icon 4548
  • 10.1109/iccv.2017.304
Least Squares Generative Adversarial Networks
  • Oct 1, 2017
  • Xudong Mao + 5 more

  • Open Access Icon
  • Cite Count Icon 27
  • 10.1016/j.neucom.2020.11.057
End-to-end attack on text-based CAPTCHAs based on cycle-consistent generative adversarial network
  • Dec 8, 2020
  • Neurocomputing
  • Chunhui Li + 5 more

  • Open Access Icon
  • PDF Download Icon
  • Cite Count Icon 9
  • 10.1016/j.neunet.2022.06.041
Breaking CAPTCHA with Capsule Networks
  • Jul 8, 2022
  • Neural Networks
  • Ionela Georgiana Mocanu + 2 more

  • Open Access Icon
  • Cite Count Icon 16
  • 10.31219/osf.io/km35b
Deep-CAPTCHA: A Deep Learning Based CAPTCHA Solver for Vulnerability Assessment
  • Jul 12, 2020
  • Zahra Nouri + 1 more

Similar Papers
  • PDF Download Icon
  • Research Article
  • Cite Count Icon 55
  • 10.1109/access.2020.3030058
Imbalanced Fault Diagnosis of Rolling Bearing Using Enhanced Generative Adversarial Networks
  • Jan 1, 2020
  • IEEE Access
  • Hongliang Zhang + 3 more

Machinery fault diagnosis tasks have been well addressed when sufficient and abundant data are available. However, the data imbalance problem widely exists in real-world scenarios, which leads to the performance deterioration of fault diagnosis markedly. To solve this problem, we present a novel imbalanced fault diagnosis method based on the enhanced generative adversarial networks (GAN). By artificially generating fake samples, the proposed method can mitigate the loss caused by the lack of real fault data. Specifically, in order to improve the quality of generated samples, a new discriminator is designed using spectrum normalization (SN) strategy and a two time-scale update rule (TTUR) method is used to stabilize the training process of GAN. Then, an enhanced Wasserstein GAN with gradient penalty is developed to generate high-quality synthetic samples for the fault samples set. Finally, a deep convolutional classifier is constructed to carry out fault classification. The performance and effectiveness of the proposed method are validated on the Case Western Reserve University bearing dataset and rolling bearing dataset acquired from our laboratory. The simulation results show that the proposed method has a superior performance than other methods for imbalanced fault diagnosis tasks.

  • Research Article
  • Cite Count Icon 11
  • 10.1080/10255842.2022.2134729
Imbalanced medical disease dataset classification using enhanced generative adversarial network
  • Oct 13, 2022
  • Computer Methods in Biomechanics and Biomedical Engineering
  • T Suresh + 2 more

In general, the imbalanced dataset is a major issue in health applications. The medical data classification faces the imbalanced count of data samples, here at least one class forms only a very small minority of the data, but it is a drawback of most of the machine learning algorithms. The medical datasets are mostly imbalanced in its class labels. When the dataset is imbalanced, the existing classification algorithms typically perform badly on minority class cases. To deal the class imbalance issue, an enhanced generative adversarial network (E-GAN) is proposed in this article. The proposed approach is the consolidation of deep convolutional generative adversarial network and modified convolutional neural network (DCG-MCNN). Initially, the imbalanced data is converted into balanced data in pre-processing process. Data preprocessing comprise of data cleaning, data normalization, data transformation and data reduction using Radius Synthetic minority oversampling technique (RSMOTE) method. The DCG is considered for balancing the dataset generating extra samples under training dataset. This training dataset based, the medical disease classification is enhanced by modified CNN diagnosis model. The proposed system performed is executed in MATLAB. The performance analysis is implemented under the Breast Cancer Wisconsin Dataset that provides the higher maximum geometry mean (MGM) of 8.686, 2.931 and 5.413%, and higher Matthews’s correlation coefficient (MCC) of 9.776, 1.841 and 5.413% compared to the existing methods.

  • Research Article
  • 10.1049/itr2.70005
Recovering Missing Passenger Flow Data in Subway Stations via an Enhanced Generative Adversarial Network
  • Jan 1, 2025
  • IET Intelligent Transport Systems
  • Hongru Yu + 5 more

ABSTRACTTo address the challenges posed by incomplete data in passenger flow prediction and organizational tasks, this paper proposes ProbSparse self‐attention conditional generative adversarial imputation net (ProbSA‐CGAIN), a novel imputation model framework built on the enhanced generative adversarial network (GAN). The model leverages conditional GANs for controlled data generation using external conditional information. It adopts a denoising autoencoder structure for reconstructing and estimating missing passenger flow data. The integration of an efficient ProbSparse self‐attention mechanism captures spatiotemporal evolution features, reducing computational complexity. Additionally, the model incorporates auxiliary conditional information to enhance data imputation accuracy by learning interdependencies among multiple data variables. Further, the model integrates local positional encoding and multi‐layer global temporal encoding, offering diverse perspectives on spatiotemporal information. Experimental evaluations with real passenger flow data demonstrate the model's superiority over advanced baseline models across various missing patterns and rates. Notably, it exhibits high stability in data restoration, particularly for datasets with higher missing rates, affirming its effectiveness in predicting and inferring missing passenger flow data based on auxiliary data and multi‐view positional information, ensuring reliable imputation. The experiments also assess the model's proficiency in attributing different spatiotemporal features, confirming its commendable training and restoration efficiency.

  • Conference Article
  • Cite Count Icon 114
  • 10.1145/3243734.3243754
Yet Another Text Captcha Solver
  • Oct 15, 2018
  • Guixin Ye + 7 more

Despite several attacks have been proposed, text-based CAPTCHAs are still being widely used as a security mechanism. One of the reasons for the pervasive use of text captchas is that many of the prior attacks are scheme-specific and require a labor-intensive and time-consuming process to construct. This means that a change in the captcha security features like a noisier background can simply invalid an earlier attack. This paper presents a generic, yet effective text captcha solver based on the generative adversarial network. Unlike prior machine-learning-based approaches that need a large volume of manually-labeled real captchas to learn an effective solver, our approach requires significantly fewer real captchas but yields much better performance. This is achieved by first learning a captcha synthesizer to automatically generate synthetic captchas to learn a base solver, and then fine-tuning the base solver on a small set of real captchas using transfer learning. We evaluate our approach by applying it to 33 captcha schemes, including 11 schemes that are currently being used by 32 of the top-50 popular websites including Microsoft, Wikipedia, eBay and Google. Our approach is the most capable attack on text captchas seen to date. It outperforms four state-of-the-art text-captcha solvers by not only delivering a significant higher accuracy on all testing schemes, but also successfully attacking schemes where others have zero chance. We show that our approach is highly efficient as it can solve a captcha within 0.05 second using a desktop GPU. We demonstrate that our attack is generally applicable because it can bypass the advanced security features employed by most modern text captcha schemes. We hope the results of our work can encourage the community to revisit the design and practical use of text captchas.

  • PDF Download Icon
  • Research Article
  • Cite Count Icon 8
  • 10.3390/app12031191
Enhanced Generative Adversarial Networks with Restart Learning Rate in Discriminator
  • Jan 24, 2022
  • Applied Sciences
  • Kun Li + 1 more

A series of Generative Adversarial Networks (GANs) could effectively capture the salient features in the dataset in an adversarial way, thereby generating target data. The discriminator of GANs provides significant information to update parameters in the generator and itself. However, the discriminator usually becomes converged before the generator has been well trained. Due to this problem, GANs frequently fail to converge and are led to mode collapse. This situation can cause inadequate learning. In this paper, we apply restart learning in the discriminator of the GAN model, which could bring more meaningful updates for the training process. Based on CIFAR-10 and Align Celeba, the experiment results show that the proposed method could improve the performance of a DCGAN with a low FID score over a stable learning rate scheme. Compared with two other stable GANs—SNGAN and WGAN-GP—the DCGAN with a restart schedule had a satisfying performance. Compared with the Two Time-Scale Update Rule, the restart learning rate is more conducive to the training of DCGAN. The empirical analysis indicates four main parameters have varying degrees of influence on the proposed method and present an appropriate parameter setting.

  • Research Article
  • Cite Count Icon 1
  • 10.15587/1729-4061.2020.201731
A steganographic method of improved resistance to the rich model­based analysis
  • Apr 30, 2020
  • Eastern-European Journal of Enterprise Technologies
  • Nikolay Kalashnikov + 3 more

This paper addresses the task of developing a steganographic method to hide information, resistant to analysis based on the Rich model (which includes several different submodels), using statistical indicators for the distribution of the pairs of coefficients for a discrete cosine transform (DCT) with different values. This type of analysis implies calculating the number of DCT coefficients pairs, whose coordinates in the frequency domain differ by a fixed quantity (the offset). Based on these values, a classifier is trained for a certain large enough data sample, which, based on the distribution of the DCT coefficients pairs for an individual image, determines the presence of additional information in it.A method based on the preliminary container modification before embedding a message has been proposed to mitigate the probability of hidden message detection. The so-called Generative Adversarial Network (GAN), consisting of two related neural networks, generator and discriminator, was used for the modification. The generator creates a modified image based on the original container; the discriminator verifies the degree to which the modified image is close to the preset one and provides feedback for the generator.By using a GAN, based on the original container, such a modified container is generated so that, following the embedding of a known steganographic message, the distribution of DCT coefficients pairs is maximally close to the indicators of the original container.We have simulated the operation of the proposed modification; based on the simulation results, the probabilities have been computed of the proper detection of the hidden information in the container when it was modified and when it was not. The simulation results have shown that the application of the modification based on modern information technologies (such as machine learning and neural networks) could significantly reduce the likelihood of message detection and improve the resistance against a steganographic analysis

  • Research Article
  • 10.29408/edumatic.v8i1.25463
Evaluasi Risiko Celah Keamanan Aplikasi E-Office menggunakan Metode OWASP
  • Jun 20, 2024
  • Edumatic: Jurnal Pendidikan Informatika
  • Tata Sutabri + 3 more

Based on data from Badan Siber dan Sandi Negara (BSSN) in 2022, it was reported that a total of 1,950 security vulnerabilities were found in 457 electronic systems across various applications widely used by the public. The purpose of this research is to evaluate the risk of existing security vulnerabilities in the E-Office application and determine the level and impact that these vulnerabilities can cause. This research focuses on information system security, specifically evaluating the risk of security vulnerabilities in the E-Office application of the Ogan Ilir Regency. The research was conducted using the Open Web Application Security Project (OWASP) method with a risk rating assessment. The research process began with a literature review to gather data and information sources, determine the scope and research objectives, test, identify security vulnerabilities, analyze security vulnerabilities, and the results of the analysis. The research subject is the E-Office application of Ogan Ilir Regency, with the object of the research being the security vulnerabilities in that application. OWASPZap was used as a tool to obtain data on security vulnerabilities, and using OWASPZap, 38 security vulnerabilities were found, with 18 of them meeting the criteria of the OWASP Top 10. Our findings indicate that the security vulnerabilities in the E-Office application of Ogan Ilir Regency include vulnerabilities in authentication levels, access control, configuration, and data validation processes.

  • Research Article
  • Cite Count Icon 85
  • 10.1016/j.measurement.2021.109467
Enhanced generative adversarial network for extremely imbalanced fault diagnosis of rotating machine
  • Apr 30, 2021
  • Measurement
  • Rugen Wang + 3 more

Enhanced generative adversarial network for extremely imbalanced fault diagnosis of rotating machine

  • Research Article
  • Cite Count Icon 5
  • 10.1016/j.image.2020.116072
Joint image-to-image translation with denoising using enhanced generative adversarial networks
  • Nov 16, 2020
  • Signal Processing: Image Communication
  • Lan Yan + 3 more

Joint image-to-image translation with denoising using enhanced generative adversarial networks

  • PDF Download Icon
  • Research Article
  • Cite Count Icon 8
  • 10.3390/diagnostics13071358
Automatic Kidney Segmentation Method Based on an Enhanced Generative Adversarial Network
  • Apr 6, 2023
  • Diagnostics
  • Tian Shan + 2 more

When deciding on a kidney tumor’s diagnosis and treatment, it is critical to take its morphometry into account. It is challenging to undertake a quantitative analysis of the association between kidney tumor morphology and clinical outcomes due to a paucity of data and the need for the time-consuming manual measurement of imaging variables. To address this issue, an autonomous kidney segmentation technique, namely SegTGAN, is proposed in this paper, which is based on a conventional generative adversarial network model. Its core framework includes a discriminator network with multi-scale feature extraction and a fully convolutional generator network made up of densely linked blocks. For qualitative and quantitative comparisons with the SegTGAN technique, the widely used and related medical image segmentation networks U-Net, FCN, and SegAN are used. The experimental results show that the Dice similarity coefficient (DSC), volumetric overlap error (VOE), accuracy (ACC), and average surface distance (ASD) of SegTGAN on the Kits19 dataset reach 92.28%, 16.17%, 97.28%, and 0.61 mm, respectively. SegTGAN outscores all the other neural networks, which indicates that our proposed model has the potential to improve the accuracy of CT-based kidney segmentation.

  • Conference Article
  • Cite Count Icon 13
  • 10.1109/aike.2019.00057
Realistic Data Synthesis Using Enhanced Generative Adversarial Networks
  • Jun 1, 2019
  • Mrinal Kanti Baowaly + 2 more

Real data with privacy and confidentiality concerns are not often available or are too expensive to afford in respect of both time and money. In this situation, it is a good alternative to use synthetic data. The objective of this research is to generate realistic synthetic data so that people can use it freely. We propose a synthetic data generation model based on boundary-seeking generative adversarial networks (BGANs)–designated as medical BGAN or medBGAN and compare its performances with an existing method medical GAN (medGAN). We aim to perform the investigation on several datasets in two different domains: electronic health records (EHRs) in the medical domain and a crime dataset in the City of Los Angeles Police Department. Firstly, we train the models and generate synthetic data by using these trained models. We then analyze and compare the models' performance by applying some statistical methods (dimension-wise average and Kolmogorov-Smirnov test) and two machine learning tasks (association rule mining and prediction). The comprehensive analysis of this study shows that the proposed model is more efficient in generating realistic synthetic data than those generated using medGAN.

  • PDF Download Icon
  • Research Article
  • Cite Count Icon 8
  • 10.3390/app12126067
Single Image Super-Resolution Method Based on an Improved Adversarial Generation Network
  • Jun 15, 2022
  • Applied Sciences
  • Qiang Wang + 3 more

Super-Resolution (SR) techniques for image restoration have recently been gaining attention due to their excellent performance. For powerful learning abilities, Generative Adversarial Networks (GANs) have been proven to have achieved great success. In this paper, we propose an Enhanced Generative Adversarial Network (EGAN) for improving its effects for a real-time Super-Resolution task. The main content of this paper are as follows: (1) We adopted the Laplacian pyramid framework as a pre-trained module, which is beneficial for providing multiscale features for our input. (2) At each feature block, a convolutional skip-connections network, which may contain some latent information, was significant for the generative model to reconstruct a plausible-looking image. (3) Considering that the edge details usually play an important role in image generation, a perceptual loss function was defined to train and seek the optimal parameters. Quantitative and qualitative evaluations were demonstrated so that our algorithm not only took full advantage of the Convolutional Neural Networks (CNNs) to improve the image quality, but also performed better than other algorithms in speed and performance for real-time Super-Resolution tasks.

  • Research Article
  • Cite Count Icon 19
  • 10.1016/j.compbiomed.2022.105985
Synthetic augmentation for semantic segmentation of class imbalanced biomedical images: A data pair generative adversarial network approach
  • Sep 6, 2022
  • Computers in biology and medicine
  • Lu Chai + 6 more

Synthetic augmentation for semantic segmentation of class imbalanced biomedical images: A data pair generative adversarial network approach

  • Research Article
  • Cite Count Icon 1
  • 10.1007/s11042-024-20186-y
Parent GAN: image generation model for creating parent’s images using children’s images
  • Oct 1, 2024
  • Multimedia Tools and Applications
  • M Moheeb Emara + 2 more

In the past decade, several applications have emerged in predicting children’s images using their parents via Generative Adversarial Networks (GANs). However, no one has tackled the problem of predicting one of the parents using the other parent and their children or answering the question of the possibility of deducing the parent images from the children and other parent image features. It could be used in parental identification cases. Moreover, it could help children who don’t know one of their parents to have a visual representation of their images. To perform this task, several obstacles were overcome, like the small number of parent pairs in the dataset and stabilizing the GANs to produce good-looking images. The proposed method depends on dual GAN architecture in addition to adaptive instance normalization layers and introducing a triple loss function to stabilize further and improve the resulting images. The results were proven using a kinship verification model, a face verification model, and other well-known evaluation metrics, which showed that the generated parent images are of decent quality compared to real parents’ images with affordable computational hardware. As a result, a novel method is developed that could produce unknown parent images.

  • Research Article
  • Cite Count Icon 2
  • 10.1016/j.inffus.2024.102632
Exploring adversarial deep learning for fusion in multi-color channel skin detection applications
  • Aug 14, 2024
  • Information Fusion
  • Mohammed Chyad + 8 more

Exploring adversarial deep learning for fusion in multi-color channel skin detection applications

More from: International Journal of Engineering and Technology Innovation
  • Research Article
  • 10.46604/ijeti.2024.14795
Evaluation of Passenger Thermal Comfort for Two Different Underground Metro Station Typologies in Istanbul
  • Apr 30, 2025
  • International Journal of Engineering and Technology Innovation
  • Ismail Ahmet Odabaş + 1 more

  • Research Article
  • 10.46604/ijeti.2024.14304
FCA-ResNet: An Improved Model with Enhanced Multi-Scale Feature Fusion and Coordinate Attention for Wheat Leaf Disease Classification
  • Apr 18, 2025
  • International Journal of Engineering and Technology Innovation
  • Hongyan Zang + 5 more

  • Research Article
  • 10.46604/ijeti.2024.13977
FEGAN: A High-Performance Font Enhancement Network for Text CAPTCHA Preprocessing
  • Mar 20, 2025
  • International Journal of Engineering and Technology Innovation
  • Xing Wan + 2 more

  • Research Article
  • 10.46604/ijeti.2024.13853
Feasibility of 3D Printing on Environmentally Friendly Cementless Materials with Low Thermal Conductivity
  • Mar 20, 2025
  • International Journal of Engineering and Technology Innovation
  • Wei-Ting Lin + 4 more

  • Research Article
  • 10.46604/ijeti.2024.14104
Using Webpage Comparison Method for Automated Web Application Testing with Reinforcement Learning
  • Mar 12, 2025
  • International Journal of Engineering and Technology Innovation
  • Ci-Feng Lai + 2 more

  • Research Article
  • 10.46604/ijeti.2024.14100
A Novel Traveling Wave Fault Location Method Based on ICEEMDAN-NTEO for Distribution Network
  • Mar 12, 2025
  • International Journal of Engineering and Technology Innovation
  • Hongwen Liu + 5 more

  • Research Article
  • 10.46604/ijeti.2024.14017
Design of AlN-Based PMUT with High Electromechanical Coupling Efficiency for Breast Cancer Diagnosis
  • Feb 14, 2025
  • International Journal of Engineering and Technology Innovation
  • Chang Liu + 2 more

  • Open Access Icon
  • Research Article
  • 10.46604/ijeti.2024.13781
Lighting Design for Visual Comfort and Energy Efficiency Considerations: A Patient Room Case Study
  • Feb 14, 2025
  • International Journal of Engineering and Technology Innovation
  • Samaneh Aghajari + 1 more

  • Research Article
  • 10.46604/ijeti.2024.13827
Dynamic Allocation Strategy for the Car Rental Industry with Multiple Rental Channels
  • Dec 20, 2024
  • International Journal of Engineering and Technology Innovation
  • Peng-Sheng You + 1 more

  • Open Access Icon
  • Research Article
  • 10.46604/ijeti.2024.13748
Pest-YOLO: A YOLOv5-Based Lightweight Crop Pest Detection Algorithm
  • Dec 18, 2024
  • International Journal of Engineering and Technology Innovation
  • Wanbo Luo

Save Icon
Up Arrow
Open/Close
  • Ask R Discovery Star icon
  • Chat PDF Star icon

AI summaries and top papers from 250M+ research sources.

Search IconWhat is the difference between bacteria and viruses?
Open In New Tab Icon
Search IconWhat is the function of the immune system?
Open In New Tab Icon
Search IconCan diabetes be passed down from one generation to the next?
Open In New Tab Icon