Synthetic document generator for annotation-free layout recognition

Natraj Raman,Sameena Shah,Manuela Veloso

doi:10.1016/j.patcog.2022.108660

Abstract

Analyzing the layout of a document to identify headers, sections, tables, figures etc. is critical to understanding its content. Deep learning based approaches for detecting the layout structure of document images have been promising. However, these methods require a large number of annotated examples during training, which are both expensive and time consuming to obtain. We describe here a synthetic document generator that automatically produces realistic documents with labels for spatial positions, extents and categories of the layout elements. The proposed generative process treats every physical component of a document as a random variable and models their intrinsic dependencies using a Bayesian Network graph. Our hierarchical formulation using stochastic templates allow parameter sharing between documents for retaining broad themes and yet the distributional characteristics produces visually unique samples, thereby capturing complex and diverse layouts. We empirically illustrate that a deep layout detection model trained purely on the synthetic documents can match the performance of a model that uses real documents.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Synthetic document generator for annotation-free layout recognition

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition

Lead the way for us

Journal: Pattern Recognition	Publication Date: Mar 24, 2022
Citations: 2

Similar Papers

An exploratory study of one-shot learning using Siamese convolutional neural network for histopathology image classification in breast cancer from few data examples
Fabian Cano ... Angel Cruz-Roa
-
Fabian Cano, et. al.Fabian Cano ... Angel Cruz-Roa
03 Jan 2020
03 Jan 2020

Semi-automatic acquisition translation knowledge from parallel corpora
Fuji Ren ... S Kuroiwa
-
Fuji Ren, et. al. Fuji Ren ... S Kuroiwa
06 Oct 2002
06 Oct 2002

Human Posture Detection Using Image Augmentation and Hyperparameter-Optimized Transfer Learning Algorithms
Roseline Oluwaseun Ogundokun ... Robertas Damaševičius
Applied Sciences | VOL. 12
Roseline Oluwaseun Ogundokun, et. al.Roseline Oluwaseun Ogundokun ... Robertas Damaševičius
10 Oct 2022
Applied Sciences | VOL. 12

Natural language processing to facilitate breast cancer research and management.
Kevin S Hughes ... Jingan Zhou
The Breast Journal | VOL. 26
Kevin S Hughes, et. al.Kevin S Hughes ... Jingan Zhou
18 Dec 2019
The Breast Journal | VOL. 26

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Synthetic document generator for annotation-free layout recognition

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition