Abstract

In the paper, we propose an approach to the analysis of social groups and their relative positioning based on the identification of semantic differences in texts presented in the form of frequency dictionaries. The initial textual data was obtained by collecting records of thematic Internet communities. To collect entries, we implemented a specialized software module for downloading and analyzing posts as well as comments from open communities of interest in the social network VKontakte. The developed algorithm of frequency dictionary compilation evaluates the characteristics of data collected from social networks. For keywords identification, we propose a new approach based on the analysis of word frequency distribution, using methods for dimension reduction of feature spaces. The presented algorithm using the principal component analysis allowed to assess the significance of words by coefficients of the linear transformation. Along with the keywords, we identified semantic differences of social network communities and estimated their relative positioning in the transformed feature space.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.