Deep Learning-Based Analysis of Ancient Greek Literary Texts in English Version: A Statistical Model Based on Word Frequency and Noise Probability for the Classification of Texts

Zoltán Gál,Erzsébet Tóth

doi:10.36244/icj.2024.5.1

Abstract

In our paper we intend to present a methodology that we elaborated for clustering texts based on the word fre quency in the English translations of selected old Greek texts. We used the classification system of the ancient Library of Alex andria, devised by the prominent Greek scholar-poet, Callima chus in the 3rd century BC., as a basis for categorizing literary masterpieces. In our content analysis, we could determine a tri plet of a, b, c values for describing a power function that appro priately fits a curve determined by the word frequencies in the texts. In addition, we have discovered 16 special features of the different texts that correspond to various token categories inves tigated in each text, such as part of speech of the word in the con text, numerals, subordinate conjunction, symbols, etc. We have developed a cognitive model in which several hundred different subtexts were utilized for supervised learning with the aim of subtext class recognition. Concerning 200 subtexts, the triplet of a, b, c values, the classes of the subtexts, and their 16-dimen sional feature vectors were learnt for the Recurrent Neural Net work (RNN). It turned out that the Long-Short Term Memory RNN could efficiently predict which class a chosen subtext could be categorized into without considering the interpretation of the content. The influence of the non-zero error rate of new com munication services on the meaning of the transferred texts was also investigated. The impact of the noise on the classification accuracy was found to be linear, dependent on the character error rate.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Deep Learning-Based Analysis of Ancient Greek Literary Texts in English Version: A Statistical Model Based on Word Frequency and Noise Probability for the Classification of Texts

Abstract

Talk to us

Similar Papers

More From: Infocommunications journal

Lead the way for us

Similar Papers

Multimodal learning using 3D audio-visual data for audio-visual speech recognition
Rongfeng Su ... Xunying Liu
-
Rongfeng Su, et. al.Rongfeng Su ... Xunying Liu
01 Dec 2017
01 Dec 2017

Forecasting Banana Harvest Yields using Deep Learning
Mariannie Rebortera ... Arnel Fajardo
-
Mariannie Rebortera, et. al.Mariannie Rebortera ... Arnel Fajardo
01 Oct 2019
01 Oct 2019

A model for estimating the occurrence of same‐frequency words and the boundary between high‐ and low‐frequency words in texts
Qinglan Sun ... Charles H Davis
Journal of the American Society for Information Science | VOL. 50
Qinglan Sun, et. al.Qinglan Sun ... Charles H Davis
01 Jan 1998
Journal of the American Society for Information Science | VOL. 50

Frequency in Incidental Vocabulary Acquisition Research: An Undefined Concept and Some Consequences
Barry Lee Reynolds ... David Wible
TESOL Quarterly | VOL. 48
Barry Lee Reynolds, et. al.Barry Lee Reynolds ... David Wible
28 Oct 2014
TESOL Quarterly | VOL. 48

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep Learning-Based Analysis of Ancient Greek Literary Texts in English Version: A Statistical Model Based on Word Frequency and Noise Probability for the Classification of Texts

Abstract

Talk to us

Similar Papers

More From: Infocommunications journal