Discovering Fine-grained Sentiment in Suicide Notes

Wenbo Wang,Amit P Sheth,Ming Tan,Shaojun Wang,Lu Chen

doi:10.4137/bii.s8963

Wenbo Wang, Amit P Sheth + Show 3 more

Open Access

https://doi.org/10.4137/bii.s8963

Copy DOI

Journal: Biomedical Informatics Insights	Publication Date: Jan 1, 2012
Citations: 19	License type: cc-by-nc

Affiliation: Wright State University

Abstract

This paper presents our solution for the i2b2 sentiment classification challenge. Our hybrid system consists of machine learning and rule-based classifiers. For the machine learning classifier, we investigate a variety of lexical, syntactic and knowledge-based features, and show how much these features contribute to the performance of the classifier through experiments. For the rule-based classifier, we propose an algorithm to automatically extract effective syntactic and lexical patterns from training examples. The experimental results show that the rule-based classifier outperforms the baseline machine learning classifier using unigram features. By combining the machine learning classifier and the rule-based classifier, the hybrid system gains a better trade-off between precision and recall, and yields the highest micro-averaged F-measure (0.5038), which is better than the mean (0.4875) and median (0.5027) micro-average F-measures among all participating teams.

Full Text