Abstract

In response to the current situation that social network users have little awareness of privacy protection, users will disclose their privacy information in the information they post when using social network platforms, in order to raise awareness of personal privacy protection among social network users and help them understand the importance of protecting their privacy information. Therefore, we propose a user multi-dimensional sensitive information portrait model based on social networks, use the TF-IDF algorithm based on bag-of-words model to calculate the sensitivity of sensitive information, classify sensitive information into high, medium and low sensitivity levels according to the importance of sensitive information to users, and carve a multi-dimensional sensitive information portrait of group users. By constructing two sensitive information dictionaries, using the improved FlashText algorithm combined with the regular expression string matching algorithm and the sure inverse order circular view matching algorithm to extract user sensitive information from the basic information of social network users and the historical data posted by users in social networks, and carving a multi-dimensional sensitive information portrait of users according to sensitive information and sensitivity, users can replace sensitive information according to their needs to achieve the purpose of user privacy protection. Through experimental evaluation, our scheme achieves an accuracy of 93.63% for the extraction of sensitive information.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.