Towards Developing Uniform Lexicon Based Sorting Algorithm for Three Prominent Indo-Aryan Languages

Mir Ragib Ishraq,M Shahidur Rahman,Asif Mohammed Samir,Nitesh Khadka

doi:10.1145/3488371

Abstract

Three different Indic/Indo-Aryan languages - Bengali, Hindi and Nepali have been explored here in character level to find out similarities and dissimilarities. Having shared the same root, the Sanskrit, Indic languages bear common characteristics. That is why computer and language scientists can take the opportunity to develop common Natural Language Processing (NLP) techniques or algorithms. Bearing the concept in mind, we compare and analyze these three languages character by character. As an application of the hypothesis, we also developed a uniform sorting algorithm in two steps, first for the Bengali and Nepali languages only and then extended it for Hindi in the second step. Our thorough investigation with more than 30,000 words from each language suggests that, the algorithm maintains total accuracy as set by the local language authorities of the respective languages and good efficiency.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Towards Developing Uniform Lexicon Based Sorting Algorithm for Three Prominent Indo-Aryan Languages

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Asian and Low-Resource Language Information Processing

Lead the way for us

Journal: ACM Transactions on Asian and Low-Resource Language Information Processing	Publication Date: Dec 13, 2021
Citations: 1

Similar Papers

MainIndex Sorting Algorithm
Adeel Ahmed ... Zaheer Ahmed
-
Adeel Ahmed, et. al.Adeel Ahmed ... Zaheer Ahmed
02 Sep 2017
02 Sep 2017

Proposal of a Two Way Sorting Algorithm and Performance Comparison with Existing Algorithms
Eshan Kapur
International Journal of Computer Science, Engineering and Applications | VOL. 2
Eshan KapurEshan Kapur
30 Jun 2012
International Journal of Computer Science, Engineering and Applications | VOL. 2

Detecting Alzheimer’s Disease by Exploiting Linguistic Information from Nepali Transcript
Surendrabikram Thapa ... Mukesh Prasad
-
Surendrabikram Thapa, et. al.Surendrabikram Thapa ... Mukesh Prasad
01 Jan 2020
01 Jan 2020

Comparison of Sorting Algorithms based on Input Sequences
Ashutosh Bharadwaj ... Shailendra Mishra
International Journal of Computer Applications | VOL. 78
Ashutosh Bharadwaj, et. al.Ashutosh Bharadwaj ... Shailendra Mishra
18 Sep 2013
International Journal of Computer Applications | VOL. 78

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Towards Developing Uniform Lexicon Based Sorting Algorithm for Three Prominent Indo-Aryan Languages

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Asian and Low-Resource Language Information Processing