Abstract

Online anonymity is considered as one of the great gifts of the Internet, but it also brings dangers to society, such as cybercrime, online sexual abuse and bullying, and love scams. Many people are fond of chatting online to make new friends, but how can they be sure that the person sitting behind the other computer is really the person they claim to be? By studying stylometry and keystroke dynamics features from chat data, it is feasible to reveal the actual gender of an online user. In this paper, we examined stylometry and keystroke dynamics features from chat data, and proposed a Random Forest based gender prediction approach by analyzing these features. In order to evaluate the effectiveness of the proposed approach, a data acquisition was conducted to capture the keystroke dynamics and text information when participants were chatting remotely via Skype. All participants were invited to chat freely on any topic they prefered in order to get to know each other. Based on our experimental result, the proposed approach achieved 72% prediction accuracy by analyzing on this free-text data captured only in 15 minutes.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call