User location discovery from social media is crucial for location-based services like emergency awareness and event monitoring. Existing approaches generally integrate user-generated text features and social relationships, but insufficiently explore location-specific features and geographically proximate relationships, leading to suboptimal accuracy. In this paper, we propose a Twitter user geolocation method based on location feature enhancement, to better capture the location characteristics in users’ tweets and social relationships. Specifically, a user tweet representation algorithm based on location feature separation (TwLS) is designed. By leveraging words’ location-aware weight matrix and pre-trained embeddings, TwLS calculates a tweet representation for each user in every location, explicitly indicating the relevance between users and various locations. Additionally, we develop the local celebrity discovery method (LocCel) to construct social networks by identifying and preserving geographically concentrated high-degree nodes while filtering noise. Thereby LocCel enhances local relationships and strengthens location-proximate connections within the user social network. Experiments on two real-world datasets show that our method outperforms seven baselines, improving user geolocation accuracy by 3.1% ∼ 8.1% and 1.8% ∼ 8.8%, while reducing median error by 22.2% ∼ 52.8% and 19.4% ∼ 50.7%, respectively.
Read full abstract