A Comparison of Term Weighting Schemes for Text Classification and Sentiment Analysis with a Supervised Variant of tf.idf

Giacomo Domeniconi,Claudio Sartori,Roberto Pasolini,Gianluca Moro

doi:10.1007/978-3-319-30162-4_4

Abstract

In text analysis tasks like text classification and sentiment analysis, the careful choice of term weighting schemes can have an important impact on the effectiveness. Classic unsupervised schemes are based solely on the distribution of terms across documents, while newer supervised ones leverage the knowledge of membership of training documents to categories; these latter ones are often specifically tailored for either topic or sentiment classification. We propose here a supervised variant of the well-known tf.idf scheme, where the idf factor is computed without considering documents within the category under analysis, so that terms frequently appearing only within it are not penalized. The importance of these terms is further boosted in a second variant inspired by relevance frequency. We performed extensive experiments to compare these novel schemes to known ones, observing top performances in text categorization by topic and satisfactory results in sentiment classification.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Comparison of Term Weighting Schemes for Text Classification and Sentiment Analysis with a Supervised Variant of tf.idf

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Text sentiment analysis and classification based on bidirectional long and short term memory networks
Lijun Mai ... Dongyang Li
Applied and Computational Engineering | VOL. 77
Lijun Mai, et. al.Lijun Mai ... Dongyang Li
16 Jul 2024
Applied and Computational Engineering | VOL. 77

Sentiment Analysis using a CNN-BiLSTM Deep Model Based on Attention Classification
Wang Yue ... Li Lei
Information | VOL. 26
Wang Yue, et. al.Wang Yue ... Li Lei
15 Sep 2023
Information | VOL. 26

Term Weighting Scheme Effect in Sentiment Analysis of Online Movie Reviews
Harnani Mat Zin ... Masrah Azrifah Azmi Murad
Advanced Science Letters | VOL. 24
Harnani Mat Zin, et. al.Harnani Mat Zin ... Masrah Azrifah Azmi Murad
01 Feb 2018
Advanced Science Letters | VOL. 24

Text sentiment analysis and classification based on bidirectional Gated Recurrent Units (GRUs) model
Wei Xu ... Zhicheng Ding
Applied and Computational Engineering | VOL. 77
Wei Xu, et. al.Wei Xu ... Zhicheng Ding
16 Jul 2024
Applied and Computational Engineering | VOL. 77

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Comparison of Term Weighting Schemes for Text Classification and Sentiment Analysis with a Supervised Variant of tf.idf

Abstract

Talk to us

Similar Papers