Abstract
The goal of Twitter users and tweets discovery for the target area is to find resident users and tweets created in the area. Previous works usually utilize following relationships among users to discover new users, whose precision and speed are easily affected by fake followers and the rate-limiting of getting Twitter data. In this manuscript, an algorithm for users and tweets discovery based on mention relationship strength and local tweet ratio (MRS-LTR for short) is proposed. First, the initial tweets are obtained by using Twitter API, from which the seed users and the seed tweets are extracted, and the mention relationship graph is constructed. Then, two sound-reasonable hypotheses about the location information contained in mention relationship and user’s tweets are explored and verified on the public dataset. Next, based on the two hypotheses, three new recommendation indicators are proposed to recommend new users in the area from the users mentioned by seed users. Finally, the candidate user location verification method based on profile and iteration probability is used to expand the seed user set. The principle analysis of discovery precision, discovery speed and algorithm function shows the effectiveness of MRS-LTR. Experimental results on a dataset of 4.4M users and 112M tweets demonstrate that MRS-LTR outperforms the state-of-the-art algorithm. The precision infimum and modified precision are increased by 14.79% and 31.22%, respectively. Meanwhile, the discovery speed is significantly improved, and the number of users discovered in the same period is about 8.7 times that of the existing algorithm.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.