HindiPersonalityNet: Personality Detection in Hindi Conversational Data Using Deep Learning with Static Embedding

Akshi Kumar,Rohit Beniwal,Dipika Jain

doi:10.1145/3625228

Abstract

Personality detection along with other behavioral and cognitive assessment can essentially explain why people act the way they do and can be useful to various online applications such as recommender systems, job screening, matchmaking, and counseling. Additionally, psychometric natural language processing relying on textual cues and distinctive markers in writing style within conversational utterances reveals signs of individual personalities. This work demonstrates a text-based deep neural model, HindiPersonalityNet, of classifying conversations into three personality categories (ambivert, extrovert, introvert) for detecting personality in Hindi conversational data. The model utilizes a gated recurrent unit with BioWordVec embeddings for text classification and is trained/tested on a novel dataset, शख्सियत (pronounced as Shakhsiyat) curated using dialogues from an Indian crime-thriller drama series, Aarya . The model achieves an F1-score of 0.701 and shows the potential for leveraging conversational data from various sources to understand and predict a person's personality traits. It exhibits the ability to capture both semantic and long-distance dependencies in conversations and establishes the effectiveness of our dataset as a benchmark for personality detection in Hindi dialogue data. Further, a comprehensive comparison of various static and dynamic word embedding is done on our standardized dataset to ascertain the most suitable embedding method for personality detection.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

HindiPersonalityNet: Personality Detection in Hindi Conversational Data Using Deep Learning with Static Embedding

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Asian and Low-Resource Language Information Processing

Lead the way for us

Similar Papers

9 Distributional Word Representation for Persian
Saeedeh Momtazi
-
Saeedeh MomtaziSaeedeh Momtazi
08 May 2023
08 May 2023

Improved biomedical word embeddings in the transformer era
Jiho Noh ... Ramakanth Kavuluru
Journal of Biomedical Informatics | VOL. 120
Jiho Noh, et. al.Jiho Noh ... Ramakanth Kavuluru
18 Jul 2021
Journal of Biomedical Informatics | VOL. 120

An accurate transformer-based model for transition-based dependency parsing of free word order languages
Fatima Tuz Zuhra ... Surayya Naz
Journal of King Saud University - Computer and Information Sciences | VOL. 36
Fatima Tuz Zuhra, et. al.Fatima Tuz Zuhra ... Surayya Naz
21 Jun 2024
Journal of King Saud University - Computer and Information Sciences | VOL. 36

Circles are like Ellipses, or Ellipses are like Circles? Measuring the Degree of Asymmetry of Static and Contextual Word Embeddings and the Implications to Representation Learning
Wei Zhang ... Sadhana Kumaravel
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35
Wei Zhang, et. al.Wei Zhang ... Sadhana Kumaravel
18 May 2021
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

HindiPersonalityNet: Personality Detection in Hindi Conversational Data Using Deep Learning with Static Embedding

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Asian and Low-Resource Language Information Processing