Abstract
Benford’s Law states that, in naturally occurring systems, the frequency of numbers’ first digits is not evenly distributed. Numbers beginning with a 1 occur roughly 30% of the time, and are six times more common than numbers beginning with a 9. We show that Benford’s Law applies to social and behavioral features of users in online social networks. Using social data from five major social networks (Facebook, Twitter, Google Plus, Pinterest, and LiveJournal), we show that the distribution of first significant digits of friend and follower counts for users in these systems follow Benford’s Law. The same is true for the number of posts users make. We extend this to egocentric networks, showing that friend counts among the people in an individual’s social network also follows the expected distribution. We discuss how this can be used to detect suspicious or fraudulent activity online and to validate datasets.
Highlights
Benford’s Law states that, in naturally occurring systems, the frequency of numbers’ first digits is not evenly distributed
We show that exceptions to this rule can uncover configurations within social media systems that lead to unexpected results
We found that the distribution of first significant digit (FSD) among friends in all five datasets closely followed the values expected from Benford’s Law, with one interesting exception: the Pinterest following relationship
Summary
Benford’s Law states that, in naturally occurring systems, the frequency of numbers’ first digits is not evenly distributed. The first significant digit (FSD) of users’ friend and follower counts on Facebook, Twitter, Google Plus, Pinterest, and LiveJournal all follow Benford’s Law. Users’ numbers of posts conform to Benford.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.