Abstract

With the advent of the internet into our everyday lives, online social networks such as Facebook and Twitter have taken up a major role in networking, information deployment and entertainment. As of 2017, Twitter's outreach is over 317M monthly active users generating more than 320M tweets every day, thus making it one of the fastest information deployment mediums of this era. In order to aid data distribution without causing a glut of information to the users, we develop a recommender system focusing on a vital aspect of social media — relationships among users, addressing a popular problem of users — who to follow/befriend? By choosing the right accounts and users to follow, the sources of information can be controlled as desired. The information collected from the most recent tweets of a user is used to find other users whose recent tweets contain similar information, ensuring there is at least one mutual friend among users. By making use of the continuous and real time updating of data on social networks, we develop a method to ensure our training sets consist of relevant information for classification, thus preserving accuracy while reducing training set sizes for probabilistic learning models. We use two algorithms to detect tweets of common topics, namely a Noun Phrase detector and a Naive Bayes Text (Topic) Classifier and further compare their complexity and accuracy. The Naive Bayes Classifier, despite being probabilistic, functioned well with a relatively small training set. This is only with the exception of Twitter as it is a real-time updating framework. Exact matches were hard to obtain with the Noun phrase detector, as we are going only one level deep due to limited compute. However, when matches were found, it is upto 90% accurate. Experiments on tweets of random public users have found that Naive Bayes Classifier with a small but recent training data set can work as well as or better than a Collaborative filter without the assumptions of the Collaborative model.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.