The advent of global digital communication has accentuated the complexity of multilingual code-switching, presenting formidable challenges for sentiment analysis by necessitating nuanced interpretation of linguistic and socio-pragmatic cues. This research introduces MultiSwitchNet, a pioneering sentiment analysis model tailored for the intricate dynamics of Dravidian languages mixed with English code-switched texts. MultiSwitchNet leverages the linguistic prowess of XLM-RoBERTa, combined with socio-pragmatic insights through a novel Socio-Pragmatic Embedding Layer (SPEL) utilizing DistilBERT. Evaluated on the Dravidian Code-Mixed Dataset, MultiSwitchNet achieved superior performance metrics, including an accuracy of 72.3%, precision of 72.9%, recall of 72.1%, and an F1-score of 72.5%. Its effectiveness extends across varying degrees of code-switching complexity and demonstrates remarkable consistency in sentiment prediction across semantically similar yet syntactically diverse sentences. Statistical analyses, including ANOVA and post hoc tests, underscore the model’s significant outperformance compared to both baseline and state-of-the-art models, highlighting the indispensability of integrating socio-pragmatic awareness for accurate sentiment analysis. MultiSwitchNet’s innovative approach sets a new benchmark for sentiment analysis in multilingual contexts and opens avenues for future research into models capable of discerning the rich socio-pragmatic fabric of global digital communication.
Read full abstract