Abstract
Stylish differences among poets are usually sought in sound and semantics. In human analysis, the criteria for recognizing stylistic differences are manifold and intermingled. This study demonstrates that successful identification of poets based on their work is possible using one criterion: letter sequences. Poets show preferences for certain letter combinations, which are unique to their writing style. Using this criterion in machine computation demonstrates that semantics are not needed to identify poets correctly, and that, as a concession to utter parsimony, one minimal criterion of unique letter sequences is enough to fingerprint an author. A small sample of the work of three Dutch poets was used: Bloem (1887-1966), Slauerhoff (1898-1936), and Lucebert (1924-94). This sample formed the training set for the neural network program to analyse the unique letter patterns for each poet. Next, the program was fed a set of new poems, for which the author was to be identified. In choosing between two poets, the program succeeded in identifying the poet correctly for 80-90% of the new poems. When the choice was between three poets, the score was ∼70% correct. Since raw ASCII files are sufficient as input, and human pre-coding is unnecessary, neural network analysis of letter sequences may turn out to be a powerful tool in categorization and identification problems, such as genre, stylistics, and plagiarism.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.