Arrow R-CNN for handwritten diagram recognition

Bernhard Schäfer,Margret Keuper,Heiner Stuckenschmidt

doi:10.1007/s10032-020-00361-1

Abstract

We address the problem of offline handwritten diagram recognition. Recently, it has been shown that diagram symbols can be directly recognized with deep learning object detectors. However, object detectors are not able to recognize the diagram structure. We propose Arrow R-CNN, the first deep learning system for joint symbol and structure recognition in handwritten diagrams. Arrow R-CNN extends the Faster R-CNN object detector with an arrow head and tail keypoint predictor and a diagram-aware postprocessing method. We propose a network architecture and data augmentation methods targeted at small diagram datasets. Our diagram-aware postprocessing method addresses the insufficiencies of standard Faster R-CNN postprocessing. It reconstructs a diagram from a set of symbol detections and arrow keypoints. Arrow R-CNN improves state-of-the-art substantially: on a scanned flowchart dataset, we increase the rate of recognized diagrams from 37.7 to 78.6%.

Highlights

Graphical modeling languages are a long-used and intuitive device to visualize algorithms, business process models, and software systems
The model mostly struggles with recognizing arrows and text phrases due to their varying form and size. We agree with their motivation and propose an offline handwritten diagram recognition approach which builds upon Faster R-convolutional neural networks (CNNs) for symbol recognition
– We demonstrate how a Faster R-CNN object detector can be extended with a lightweight arrow keypoint predictor for diagram structure recognition

Summary

Introduction

Graphical modeling languages are a long-used and intuitive device to visualize algorithms, business process models, and software systems. The model mostly struggles with recognizing arrows and text phrases due to their varying form and size We agree with their motivation and propose an offline handwritten diagram recognition approach which builds upon Faster R-CNN for symbol recognition. While the recognition of computer-generated arrows in mentioned examples is important, this work focuses on handwritten diagrams, where each arrow connects two nodes, and each text phrase annotates either a node or an arrow. This structure is simple, it is sufficiently powerful to describe graphical modeling languages from various domains.

Related work

Handwritten diagram recognition

Keypoint detection

Arrow R-CNN

Network architecture

Training

Inference

Integrating diagram domain knowledge

Augmentation

Diagram-aware postprocessing

Experiments

Datasets

Evaluation metrics

Implementation

Results

Error analysis and future work

Conclusion

Compliance with ethical standards

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International Journal on Document Analysis and Recognition (IJDAR)	Publication Date: Feb 2, 2021
Citations: 16	License type: open-access

R Discovery Prime

R Discovery Prime

Arrow R-CNN for handwritten diagram recognition

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal on Document Analysis and Recognition (IJDAR)

Lead the way for us

Similar Papers

LWIR sensor parameters for deep learning object detectors
Robert Grimming ... Abhijit Mahalanobis
OSA Continuum | VOL. 4
Robert Grimming, et. al.Robert Grimming ... Abhijit Mahalanobis
01 Feb 2021
OSA Continuum | VOL. 4

Phytoplankton detection and recognition in freshwater digital microscopy images using deep learning object detectors
Jorge Figueroa ... Jorge Novo
Heliyon | VOL. 10
Jorge Figueroa, et. al.Jorge Figueroa ... Jorge Novo
30 Jan 2024
Heliyon | VOL. 10

Training Deep Learning Spacecraft Component Detection Algorithms Using Synthetic Image Data
Herbert Viggh ... Yaron Rachlin
-
Herbert Viggh, et. al.Herbert Viggh ... Yaron Rachlin
04 Mar 2023
04 Mar 2023

An object perception and positioning method via deep perception learning object detection
Limei Xiao ... Weizhe Gao
Concurrency and computation : practice & experience | VOL. 35
Limei Xiao, et. al.Limei Xiao ... Weizhe Gao
24 Jan 2021
Concurrency and computation : practice & experience | VOL. 35

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Arrow R-CNN for handwritten diagram recognition

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal on Document Analysis and Recognition (IJDAR)