Abstract

Recognition of amount in figures in the financial multi-bill scenes is crucial for the automatic banking business. However, the diversity of banking business and the limitation of customer data privacy determine that it is difficult to collect a large number of sample datasets. Aiming at the problem of insufficient training data in multi-bill scenes and the low accuracy of the detection model, this article proposes a new generative adversarial network (GAN) to generate new samples and to expand the bill dataset, which is then adopted to train a framework for recognition of the bill amount. In the proposed WGAN-SA, a residual block is adopted as the basic structure of the generator and the discriminator, and the self-attention mechanism is also utilized to improve the generation performance. In addition, Wasserstein distance is utilized to measure the distance between real and synthetic samples. Experimental results on the benchmark dataset and comparisons with state-of-the-art works show that our proposed WGAN-SA can effectively improve the few-sample learning performance. Besides, experiments on the bill dataset verify that our method can solve the problem of model collapse and has the ability to generate images of the amount in figures with better fidelity and variety, which is also helpful to achieve better bill amount recognition performance compared with other latest works.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call