Quantitative analysis of mathematical documents

S Uchida,M Suzuki,A Nomura

doi:10.1007/s10032-005-0142-y

Quantitative analysis of mathematical documents

S Uchida, M Suzuki + Show 1 more

https://doi.org/10.1007/s10032-005-0142-y

Copy DOI

Journal: International Journal on Document Analysis and Recognition (IJDAR)	Publication Date: Sep 1, 2005
Citations: 40

Affiliation: Kyushu University

#Abnormal Characters #Character Categories + Show 5 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Mathematical documents are analyzed from several viewpoints for the development of practical OCR for mathematical and other scientific documents. Specifically, four viewpoints are quantified using a large-scale database of mathematical documents, containing 690,000 manually ground-truthed characters: (i) the number of character categories, (ii) abnormal characters (e.g., touching characters), (iii) character size variation, and (iv) the complexity of the mathematical expressions. The result of these analyses clarifies the difficulties of recognizing mathematical documents and then suggests several promising directions to overcome them.

Full Text