Transformers and meta-tokenization in sentiment analysis for software engineering

Nathan Cassee,Andrei Agaronian,Eleni Constantinou,Nicole Novielli,Alexander Serebrenik

doi:10.1007/s10664-024-10468-2

Abstract

Sentiment analysis has been used to study aspects of software engineering, such as issue resolution, toxicity, and self-admitted technical debt. To address the peculiarities of software engineering texts, sentiment analysis tools often consider the specific technical lingo practitioners use. To further improve the application of sentiment analysis, there have been two recommendations: Using pre-trained transformer models to classify sentiment and replacing non-natural language elements with meta-tokens. In this work, we benchmark five different sentiment analysis tools (two pre-trained transformer models and three machine learning tools) on 2 gold-standard sentiment analysis datasets. We find that pre-trained transformers outperform the best machine learning tool on only one of the two datasets, and that even on that dataset the performance difference is a few percentage points. Therefore, we recommend that software engineering researchers should not just consider predictive performance when selecting a sentiment analysis tool because the best-performing sentiment analysis tools perform very similarly to each other (within 4 percentage points). Meanwhile, we find that meta-tokenization does not improve the predictive performance of sentiment analysis tools. Both of our findings can be used by software engineering researchers who seek to apply sentiment analysis tools to software engineering data.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Transformers and meta-tokenization in sentiment analysis for software engineering

Abstract

Talk to us

Similar Papers

More From: Empirical Software Engineering

Lead the way for us

Journal: Empirical Software Engineering	Publication Date: Jun 3, 2024
License type: CC BY 4.0

Similar Papers

On negative results when using sentiment analysis tools for software engineering research
Robbert Jongeling ... Subhajit Datta
Empirical Software Engineering | VOL. 22
Robbert Jongeling, et. al.Robbert Jongeling ... Subhajit Datta
10 Jan 2017
Empirical Software Engineering | VOL. 22

Sentiment analysis for software engineering
Bin Lin ... Fiorella Zampetti
-
Bin Lin, et. al.Bin Lin ... Fiorella Zampetti
27 May 2018
27 May 2018

Choosing your weapons: On sentiment analysis tools for software engineering research
Robbert Jongeling ... Subhajit Datta
-
Robbert Jongeling, et. al.Robbert Jongeling ... Subhajit Datta
01 Sep 2015
01 Sep 2015

Development and Application of Sentiment Analysis Tools in Software Engineering: A Systematic Literature Review
Martin Obaidi ... Jil Klünder
-
Martin Obaidi, et. al.Martin Obaidi ... Jil Klünder
21 Jun 2021
21 Jun 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Transformers and meta-tokenization in sentiment analysis for software engineering

Abstract

Talk to us

Similar Papers

More From: Empirical Software Engineering