Abstract

Most social networks offer users to mark the current geolocation and link it to published posts. We propose a method for determining the general topic image of the area using such posts. Our method includes Data Splitting, Text Preprocessing, Record Filtration, Topic Modeling, segmentation, and aggregation. To demonstrate the approach, 5.46 million geotagged posts were collected for Saint Petersburg, Russia. As a result of the proposed method, after filtering, 2.65 million non-advertising posts are left, for which 10583 topics are found (4287 aggregated topics). In addition, the city is divided into clusters of attractions and districts based on data, which help to find the real points of interest of the city.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.