Authorship Attribution Through Function Word Adjacency Networks

Santiago Segarra,Mark Eisen,Alejandro Ribeiro

doi:10.1109/tsp.2015.2451111

Abstract

A method for authorship attribution based on function word adjacency networks (WANs) is introduced. Function words are parts of speech that express grammatical relationships between other words but do not carry lexical meaning on their own. In the WANs in this paper, nodes are function words and directed edges stand in for the likelihood of finding the sink word in the ordered vicinity of the source word. WANs of different authors can be interpreted as transition probabilities of a Markov chain and are therefore compared in terms of their relative entropies. Optimal selection of WAN parameters is studied and attribution accuracy is benchmarked across a diverse pool of authors and varying text lengths. This analysis shows that, since function words are independent of content, their use tends to be specific to an author and that the relational data captured by function WANs is a good summary of stylometric fingerprints. Attribution accuracy is observed to exceed the one achieved by methods that rely on word frequencies alone. Further combining WANs with methods that rely on word frequencies alone, results in larger attribution accuracy, indicating that both sources of information encode different aspects of authorial styles.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Authorship Attribution Through Function Word Adjacency Networks

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Signal Processing

Lead the way for us

Journal: IEEE Transactions on Signal Processing	Publication Date: Jun 17, 2014
Citations: 105

Similar Papers

An analysis of the Word Adjacency Network method—Part 2—A true understanding of the method
Pervez Rizvi
Digital Scholarship in the Humanities | VOL. 38
Pervez RizviPervez Rizvi
17 May 2022
Digital Scholarship in the Humanities | VOL. 38

AUTHORSHIP ATTRIBUTION OF RESPONSA USING CLUSTERING
Yaakov Hacohen-Kerner ... Orr Margaliot
Cybernetics and Systems | VOL. 45
Yaakov Hacohen-Kerner, et. al.Yaakov Hacohen-Kerner ... Orr Margaliot
18 Aug 2014
Cybernetics and Systems | VOL. 45

A Response to Rosalind Barber’s Critique of the Word Adjacency Method for Authorship Attribution
Santiago Segarra ... Alejandro Ribeiro
ANQ: A Quarterly Journal of Short Articles, Notes and Reviews | VOL. 34
Santiago Segarra, et. al.Santiago Segarra ... Alejandro Ribeiro
21 Jan 2020
ANQ: A Quarterly Journal of Short Articles, Notes and Reviews | VOL. 34

“I would I had that corporal soundness”: Pervez Rizvi's Analysis of the Word Adjacency Network Method of Authorship Attribution
Gabriel Egan ... Alejandro Ribeiro
Digital Scholarship in the Humanities | VOL. 38
Gabriel Egan, et. al.Gabriel Egan ... Alejandro Ribeiro
28 Apr 2023
Digital Scholarship in the Humanities | VOL. 38

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Authorship Attribution Through Function Word Adjacency Networks

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Signal Processing