Abstract
Recent years have seen an increasing attention to social aspects of software engineering, including studies of emotions and sentiments experienced and expressed by the software developers. Most of these studies reuse existing sentiment analysis tools such as SentiStrength and NLTK. However, these tools have been trained on product reviews and movie reviews and, therefore, their results might not be applicable in the software engineering domain. In this paper we study whether the sentiment analysis tools agree with the sentiment recognized by human evaluators (as reported in an earlier study) as well as with each other. Furthermore, we evaluate the impact of the choice of a sentiment analysis tool on software engineering studies by conducting a simple study of differences in issue resolution times for positive, negative and neutral texts. We repeat the study for seven datasets (issue trackers and Stack Overflow questions) and different sentiment analysis tools and observe that the disagreement between the tools can lead to diverging conclusions. Finally, we perform two replications of previously published studies and observe that the results of those studies cannot be confirmed when a different sentiment analysis tool is used.
Highlights
Sentiment analysis is “the task of identifying positive and negative opinions, emotions, and evaluations” (Wilson et al 2005)
We observe that NLTK and SENTISTRENGTH agree only on one relation for the ANDROID, i.e., that issues with the neutral sentiment tend to be resolved more slowly than issues formulated in a more positive way
We observe that for GNOME and Apache Software Foundation (ASF) the tools agree that the issues with the neutral sentiment are resolved faster than issues with the positive sentiment, i.e., the results for GNOME and ASF are opposite from those for ANDROID
Summary
Sentiment analysis is “the task of identifying positive and negative opinions, emotions, and evaluations” (Wilson et al 2005). The spectrum of sentiment analysis techniques ranges from identifying polarity (positive or negative) to a complex computational treatment of subjectivity, opinion and sentiment (Pang and Lee 2007). With the proliferation of collaborative development environments, discussion between developers are recorded and archived to an extent that could not be conceived before. The availability of such discussion materials makes it easy to study whether and how the sentiments expressed by software developers influence the outcome of development activities. With this background, we apply sentiment polarity analysis to several software development ecosystems in this study
Published Version (Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have