Abstract
Despite differences in the way that men and women experience goods and communicate their perspectives, online review communities typically do not provide participants' gender. We propose to infer author gender, given a set of reviews of a particular item, and experiment on reviews posted at the Internet Movie Database (IMDb). Using logistic regression, we explore the contribution of three types of information: 1) style, 2) content, and 3) metadata (e.g. review age, social feedback). Our results concur with previous research, in that there are salient differences in writing style and content between reviews authored by men versus women. However, in comparison to literary or scientific texts, to which classification tasks are often applied, reviews are brief and occur within the context of an ongoing discourse. Therefore, to compensative for the brevity of reviews, content and stylistic features can be augmented with metadata. We find in particular that the perceived utility of a review is an important correlate of gender. The model incorporating all features has a classification accuracy of 73.7% and is not as sensitive to review length as are those based only on stylistic or content features.
Full Text
Topics from this Paper
Stylistic Features
Content Features
Internet Movie Database
Social Feedback
Scientific Texts
+ Show 5 more
Create a personalized feed of these topics
Get StartedTalk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Similar Papers
Jan 1, 2014
Jun 1, 2021
Knowledge and Information Systems
Sep 18, 2012
Journal of Computer-Aided Design & Computer Graphics
Jun 1, 2022
Applied Soft Computing
Nov 1, 2021
IEEE Transactions on Multimedia
Jan 1, 2023
Молодий вчений
Mar 31, 2021
Jul 26, 2021
Jun 4, 2023
IEEE Transactions on Cybernetics
Apr 1, 2023
2007 International Conference on Natural Language Processing and Knowledge Engineering
Oct 29, 2007
International Journal on Recent and Innovation Trends in Computing and Communication
Jul 13, 2023
Jan 1, 2017
Jul 1, 2022