Abstract
Social media has become a place for social media analysts to obtain data to gain deeper insights and understanding of user behavior, trends, public opinion, and patterns associated with social media usage. Twitter is one of the most popular social media platforms where users can share messages or ”tweets” in a short text format. However, on Twitter, user information such as gender is not shown, but without realizing it or not, there is information about it in an unstructured manner. In social media analytics, gender is one of the important data that someone likes, so this research was conducted to determine the best accuracy for gender classification. The purpose of this study was to determine whether using combined data can improve the accuracy of gender classification using data from Twitter, tweets, and descriptions. The method used was word vector representation using word2vec and the application of a 2D Convolutional Neural Network (CNN) model. Word2vec was used to generate word vector representations that take into account the context and meaning of words in the text. The 2D CNN model extracted features from the word vector representation and performed gender classification. The research aimed to compare tweet data, descriptions, and a combination of tweets and descriptions to find the most accurate. The result of this study was that combined data between tweets and
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
More From: MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.