Abstract

The rapid expansion of social media platforms has made linking user profiles across various networks an essential aspect of maintaining a consistent identity. With 4.66 billion users reported to be in the Websphere, many are active on multiple social media platforms simultaneously. Identifying users across multiple platforms poses challenges in integrating user profiles from various sources. Different matching schemes have been suggested over the years based on different user profile features, but very little information has been uncovered about user-generated text as a unique attribute for user profile matching, which generally poses real challenges in real-world scenarios. As many users have insufficient text and the use of non-discrete text information makes the comparison operation between the two social networks of quadratic complexity. Our study examines the different existing literature schemes for matching user profile pairs based only on their generated textual content. We suggest and evaluate the effectiveness of a two stage matching approach based on Locality Sensitive Hashing clustering and nearest neighbor search. We also present other matching results of different user representations language models and matching schemes.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call