Approximate Entropy in Canonical and Non-Canonical Fiction.

Mahdi Mohseni,Christoph Redies,Volker Gast

doi:10.3390/e24020278

Abstract

Computational textual aesthetics aims at studying observable differences between aesthetic categories of text. We use Approximate Entropy to measure the (un)predictability in two aesthetic text categories, i.e., canonical fiction (‘classics’) and non-canonical fiction (with lower prestige). Approximate Entropy is determined for series derived from sentence-length values and the distribution of part-of-speech-tags in windows of texts. For comparison, we also include a sample of non-fictional texts. Moreover, we use Shannon Entropy to estimate degrees of (un)predictability due to frequency distributions in the entire text. Our results show that the Approximate Entropy values can better differentiate canonical from non-canonical texts compared with Shannon Entropy, which is not true for the classification of fictional vs. expository prose. Canonical and non-canonical texts thus differ in sequential structure, while inter-genre differences are a matter of the overall distribution of local frequencies. We conclude that canonical fictional texts exhibit a higher degree of (sequential) unpredictability compared with non-canonical texts, corresponding to the popular assumption that they are more ‘demanding’ and ‘richer’. In using Approximate Entropy, we propose a new method for text classification in the context of computational textual aesthetics.

Highlights

Academic Editor: ErnestinaComputational textual aesthetics is an emerging field at the interface of literary studies and linguistics
As we wish to determine to what extent any observed differences are genre-related, we included non-fictional texts in our comparison
The most important observation that stands out from a superficial inspection of Tables 2 and 3 is that the left two columns, which show the values for canonical and noncanonical fiction, exhibit a rather uniform pattern: while there are no significant differences between the values for sentence length, the Approximate Entropy (ApEn) as well as the Shannon Entropy (ShEn) values for all series derived from POS-frequencies within boxes are higher for canonical than for non-canonical texts

Summary

Introduction

Computational textual aesthetics is an emerging field at the interface of literary studies and linguistics. Mohseni et al [9] used a number of textual properties (sentence length, frequencies of specific POS-tags per sentence, lexical diversity measured with MTLD and topic probabilities) to generate series They analysed these series in terms of variance and long-range correlations. Of particular interest in this context are features that are amenable to experimental studies, if they allow for an interpretation in terms of perception and processing, as has been hypothesized for fractality and long-range correlations [9] Another important aspect of aesthetic perception is the degree of (ir)regularity in a text and, related to this, the degree of predictability or surprise in the signal—cf Zipf’s principles of ‘unification’ and ’diversification’.

Data and Methods

Properties Underlying Textual Structure

Computation of Unpredictability in Text

Shannon Entropy

Approximate Entropy

Results

Statistical Analysis of Features

Classification

Most Discriminative Features

Discussion and Conclusions

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Entropy	Publication Date: Feb 15, 2022
Citations: 5	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Approximate Entropy in Canonical and Non-Canonical Fiction.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Entropy

Lead the way for us

Similar Papers

Fractal dimension and approximate entropy of heart period and heart rate: awake versus sleep differences and methodological issues
Vikram K. YERAGANI ... E. SOBOLEWSKI
Clinical Science | VOL. 95
Vikram K. YERAGANI, et. al.Vikram K. YERAGANI ... E. SOBOLEWSKI
01 Sep 1998
Clinical Science | VOL. 95

Gait Variability Patterns are Altered in Healthy Young Individuals During the Acute Reperfusion Phase of Ischemia-Reperfusion
Sara A Myers ... Jason M Johanning
Journal of Surgical Research | VOL. 164
Sara A Myers, et. al.Sara A Myers ... Jason M Johanning
18 May 2010
Journal of Surgical Research | VOL. 164

Application of complexity and approximate entropy on fault diagnoses
Bingcheng Wang ... Zhaohui Ren
-
Bingcheng Wang, et. al.Bingcheng Wang ... Zhaohui Ren
01 Aug 2010
01 Aug 2010

Anomaly detection for equipment condition via cross-correlation approximate entropy
Tianyang Wang ... Jianyong Li
-
Tianyang Wang, et. al.Tianyang Wang ... Jianyong Li
01 Jan 2010
01 Jan 2010

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Approximate Entropy in Canonical and Non-Canonical Fiction.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Entropy