Abstract

Category suggestions or recommendations for customers or users have become an essential feature for commerce or leisure websites. This is a growing topic that follows users’ activity in social networks generating a huge quantity of information about their interests, contacts, among many others. These data are usually collected to analyze people’s behavior, trends, and integrate a complete user profile. In this sense, we analyze a dataset collected from Pinterest to predict the gender and age by processing input images using a Convolutional Neural Network. Our method is based on the meaning of the image rather than the visual content. Additionally, we propose a heuristic-based approach for text analysis to predict users’ age and gender from Twitter. Both of the classifiers are based on text and images and they are compared with various similar approaches in the state of the art. Suggested categories are based on association rules conformed by the activity of thousands of users in order to estimate trends. Computer simulations showed that our approach can recommend interesting categories for a user analyzing his current interest and comparing this interest with similar users’ profiles or trends and, therefore, achieve an improved user profile. The proposed method is capable of predicting the user’s age with high accuracy, and at the same time, it is able to predict gender and category information from the user. The certainty that one or more suggested categories be interesting to people is higher for those users with a large number of publications.

Highlights

  • Multiple studies have been done in order to explore how to predict age, gender, authorship, region, and other characteristics of the author of a text

  • We discuss the results of the methodology that is described in Section 2.5 for text analysis with the PAN 2015 database DBT

  • Social networks like Twitter or Pinterest are essentially used for different purposes

Read more

Summary

Introduction

Multiple studies have been done in order to explore how to predict age, gender, authorship, region, and other characteristics of the author of a text. Several statistics, people similarities, and trends are computed from the user activity in social networks, including the author profiling, which remains as an open multifactor challenge In this context, the most prominent conference about author profiling (PAN at CLEF) has been studying, since 2013, these topics; it was until 2018 when images were included as part of the training sets to explore the age and gender identification using images and text information. This work focuses on the analysis of users’ publications in social networks, Pinterest and Twitter, with the aim of establishing a representative user model based on age and gender. We evaluate two different types of datasets (from Twitter and Pinterest) to compare text and image features performance separately for predicting an initial profile for each user in the dataset (age and gender). These rules are huge number of images different and categories setthe upassociation the association rules These rules generated by the Apriori algorithm applied to recommend additional categories to Pinterest users.

Method Description
General Methodology
Description of Pinterest Database
32. Women’s Fashion
Transfer Learning
Gender and Age Prediction by Text
User Profile and Category Suggestion Approach
Results for Age and Gender Prediction Using Images from Pinterest
Results for Gender and Age Prediction Using Text from Twitter
Trends Analysis and Category Recommendations
Discussion and Conclusions
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.