Abstract

This article extends recent work on the application of computational linguistics to the analysis of poetry. The dataset consisted of 85 canonical English poems and a matched control group of obscure poems. I used Linguistic Inquiry and Word Count to create more than 65 linguistic variables and then used machine learning to develop a classifier designed to distinguish between the canonical (highly anthologized) poems and the obscure (seldom anthologized) poems. The classifier consists of 6 variables and has an accuracy of 69% in distinguishing between canonical and obscure poems. I then ranked the poems using the probability scores of the classifier and found that Blake's A Poison Tree scored highest. I explain the ranking method as being a means of distilling the “literary” appeal from the “popular” appeal of the poems in the sample. Finally, I discuss the implications for the theory of poetry in general.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.