Abstract


 
 
 Segmentation is an important step for developing any optical character recognition (OCR) system, which has to be redesigned for each script having, non-uniform nature/property. It is used to decompose the image into its sub-units, which act as a basis for character recognition. Brahmi is a non-cursive ancient script, in which characters are not attached to each other and have some spacing between them. This study analyses various segmentation methods for different scripts to develop the best suitable segmentation method for Brahmi. MATLAB software was used for segmentation purpose in the experiment. The sample data belongs to Brahmi script-based ‘Rumandei inscription’. In this paper, we discuss a segmentation methodology for distinct components, namely text lines, words and characters of Rumandei inscription, written in Brahmi script. For segmenting distinct components of inscription different approach were used like horizontal projection profile, vertical projection profile and Relative minima approach. This is fundamental research on an inscription based on Brahmi script, which acts as a foundation for developing a segmentation module of an OCR solution/system of similar scripts in future. Information search and retrieval is an important activity of a library. So, to ensure this support for digitised documents written in ancient script, their character recognition is mandatory through the OCR system.
 
 

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call