Thinning of Balti Script : Way Forward to Balti OCR

Dil Nawaz Hakro Dil Nawaz Hakro,Abdul Majid Abdul Majid,Muhammad Nadeem Muhammad Nadeem,Mashooq Ali Mahar Mashooq Ali Mahar,Qinbo Qinbo,Dilawar Khan Dilawar Khan,Saba Brahmani Saba Brahmani

doi:10.32628/cseit2410428

Abstract

Natural language is one of the applications of Artificial Intelligence, which trains machines to do the jobs in human language. OCR is one of the fields where the writing efforts are omitted and text images are converted into editable text. An OCR may have post and preprocessing to enhance the text image more suitable for the rest of the OCR process. Thinning is the preprocessing approach in which the characters, words and text is thinned to its one-pixel skeleton. Much of the work has been done in the various languages of the world as well as Pakistani languages. The work on Balti OCR is nonexistent. In this study, a thinning algorithm is proposed for the Balti language, a language spoken in the northern areas of Pakistan and India. Many of the Balti images were tested with the proposed algorithm and the proposed system produced accurate results by giving a one pixel skeleton of input image. The proposed algorithm tested with hundreds of Balti language images and selected results are presented in this paper. The current research has many directions including the way forward to building Balti OCR, Balti ICR (both segmentation based and segmentation free).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Thinning of Balti Script : Way Forward to Balti OCR

Abstract

Talk to us

Similar Papers

More From: International Journal of Scientific Research in Computer Science, Engineering and Information Technology

Lead the way for us

Journal: International Journal of Scientific Research in Computer Science, Engineering and Information Technology	Publication Date: Oct 25, 2024
License type: CC BY 4.0

Similar Papers

Standardization of Robot Instruction Elements Based on Conditional Random Fields and Word Embedding
...
-
, et. al. ...
25 Oct 2019
25 Oct 2019

Developments in The Field of Natural Language Processing

International Journal of Advanced Research in Computer Science | VOL. 8

30 Apr 2017
International Journal of Advanced Research in Computer Science | VOL. 8

Segmentation Method for Myanmar Character Recognition Using Block based Pixel Count and Aspect Ratio
Kyi Pyar Zaw ... Zin Mar Kyu
-
Kyi Pyar Zaw, et. al.Kyi Pyar Zaw ... Zin Mar Kyu
28 Oct 2017
28 Oct 2017

Active: a unified platform for building intelligent applications

-

01 Jan 2008
01 Jan 2008

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Thinning of Balti Script : Way Forward to Balti OCR

Abstract

Talk to us

Similar Papers

More From: International Journal of Scientific Research in Computer Science, Engineering and Information Technology