Abstract

The user demographic prediction problem is one of the critical processes in the construction of user profiles, which is of great significance for understanding users’ characteristics and attributes. Most of the prior works on this problem either used only single-source data or employed a hard-matching method to handle multi-source data. These methods will result in a great loss of data and information in many circumstances, which may affect the model’s accuracy as well as the application scenarios. In order to solve these problems, this paper proposes a framework for user demographic prediction based on mobile and survey data, and presents a Deep Structured Fusion Model (DSFM) using neural networks with attention mechanisms to perform data fusion by comparing user similarity between two heterogeneous datasets. We examine the effectiveness of the framework and the fusion model on a real-world mobile dataset with almost one billion users, using a survey dataset containing 29,809 users’ questionnaire results as an additional information source to predict users’ age and gender. Our framework achieves excellent results on these datasets, increasing the prediction accuracy of gender and age by up to 3.23% and 5.21% compared to the best baseline model.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.