A brand-new character recognition method has been created especially for Tamil manuscripts written on palm leaves. A variety of preprocessing techniques are used to obtain reliable recognition because these manuscripts are complicated and have age-related deterioration. Using cutting-edge image processing techniques, manuscript photos are enhanced and preprocessed in the earliest stage to improve clarity. Then, in accordance with the distinctive arrangement of these historical artifacts, these photos are divided into three separate portions. After segmentation, these parts are converted to grayscale to create the ideal contrast needed for character detection. Noise canceling algorithms have been incorporated to prevent any distortions or degradations that may occur as a result of the age of the manuscripts or other environmental influences. After this mitigation, the photos are binarized to emphasize the isolation of the figures from their surroundings. This makes sure that each character stands out clearly, laying the groundwork for the crucial next step: segmenting individual characters. The foundation of the training phase is this segmentation. Convolutional neural networks (CNNs), which excel at handling image-based tasks, were incorporated into the deep learning architecture used for model building. Utilizing a sizable dataset compiled from numerous articles, the model underwent intensive training. As a result, the model was able to recognise characters from these texts with an astounding accuracy rate of 89%. With the help of Flask, the complete system has been easily integrated into a user-friendly web interface. This platform's goal is quite clear: fusing the elegance and profundity of ancient scripts with the effectiveness of contemporary technology. It is positioned as a crucial tool for both academic research and teaching purposes. By doing this, it guarantees that the priceless history and knowledge found in these manuscripts remain accessible and lays the groundwork for their preservation for future generations. Keywords— Character recognition, Tamil palm manuscripts, Preprocessing, Visual clarity, Grayscale conversion, Noise cancellation, Binarization, Segmentation, Training, Flask-based web interface, Research, Education.
Read full abstract