Abstract
The present study deals with the syntactic aspects of the Gurmukhi script by applying several standard statistical measures. The analysis is performed on the text written in seven distinct genres, which amount to >6 million words and >440 thousand sentences. The assessment of the textual data is performed at two syntactic levels—words and sentences. Revelations are made using statistical techniques on parameters such as word length, character frequency, vowel usage, word frequency, word length frequency, type token ratio (TTR), characters usage in a sentence, words usage in a sentence, words usage after the removal of stop-words in the sentence, characters usage after the removal of stop-words in the sentence, and correlation. This manuscript reveals the hidden facts of the Gurmukhi script and lays the groundwork for future research in the quantitative linguistics research.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.