Gaussian Mixture Modeling of Neighbor Characters for Multilingual Text Extraction in Images

Hui Fu,Hongbin Deng,Xiabi Liu,Yingmin Jia

doi:10.1109/icip.2006.312883

Gaussian Mixture Modeling of Neighbor Characters for Multilingual Text Extraction in Images

Hui Fu, Hongbin Deng + Show 2 more

https://doi.org/10.1109/icip.2006.312883

Copy DOI

Publication Date: Oct 1, 2006

Citations: 6

Affiliation: Beijing Institute of Technology

#Text Extraction In Images #Text Extraction + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

This paper proposes a new method to extract multilingual text in images through discriminating characters from non-characters based on the Gaussian mixture modeling of neighbor characters. The image is binarized and the morphological closing operation is performed on the binary image, in order that each character in it can be treated as a connected component; the neighborhood of connected components are computed based on the Voronoi partition of the image, and each connected component is labeled as character or non-character according to its neighbors. We applied the proposed text extraction method to Chinese and English text extraction, the effectiveness of which is confirmed by the experimental results.

Full Text