Abstract

As stock trading became a popular topic on Twitter, many researchers have proposed different approaches to make predictions on it, relying on the emotions found in messages. However, detailed studies require a reasonably sized corpus with emotions properly annotated. In this work, we introduce a corpus of tweets in Brazilian Portuguese annotated with emotions. Comprising 4,277 tweets, this is, to the best of our knowledge, the largest annotated corpus available in the stock market domain for this language. Amongst its possible uses, the corpus lends itself to the application of machine learning models for automatic emotion identification, as well as to the study of correlations between emotions and stock price movements.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call