Title Zero-shot emotion detection for semi-supervised sentiment analysis using sentence transformers and ensemble learning /
Authors Gebremichael Tesfagergish, Senait ; Kapočiūtė-Dzikienė, Jurgita ; Damaševičius, Robertas
DOI 10.3390/app12178662
Full Text Download
Is Part of Applied sciences.. Basel : MDPI. 2022, vol. 12, iss. 17, art. no. 8662, p. 1-19.. ISSN 2076-3417
Keywords [eng] sentiment analysis ; emotion detection ; sentence transformers ; zero-shot model ; ensemble learning ; natural language processing
Abstract [eng] We live in a digitized era where our daily life depends on using online resources. Businesses consider the opinions of their customers, while people rely on the reviews/comments of other users before buying specific products or services. These reviews/comments are usually provided in the non-normative natural language within different contexts and domains (in social media, forums, news, blogs, etc.). Sentiment classification plays an important role in analyzing such texts collected from users by assigning positive, negative, and sometimes neutral sentiment values to each of them. Moreover, these texts typically contain many expressed or hidden emotions (such as happiness, sadness, etc.) that could contribute significantly to identifying sentiments. We address the emotion detection problem as part of the sentiment analysis task and propose a two-stage emotion detection methodology. The first stage is the unsupervised zero-shot learning model based on a sentence transformer returning the probabilities for subsets of 34 emotions (anger, sadness, disgust, fear, joy, happiness, admiration, affection, anguish, caution, confusion, desire, disappointment, attraction, envy, excitement, grief, hope, horror, joy, love, loneliness, pleasure, fear, generosity, rage, relief, satisfaction, sorrow, wonder, sympathy, shame, terror, and panic). The output of the zero-shot model is used as an input for the second stage, which trains the machine learning classifier on the sentiment labels in a supervised manner using ensemble learning. The proposed hybrid semi-supervised method achieves the highest accuracy of 87.3% on the English SemEval 2017 dataset.
Published Basel : MDPI
Type Journal article
Language English
Publication date 2022
CC license CC license description