Code-switching Data Research Articles

Classic linguistic models, such as Chomsky’s minimalist schematization of the human language faculty, were typically based on a ‘monolingual ideal’. More recently, models have been extended to bilingual cognition. For instance, MacSwan (2000) posited that bilingual individuals possess a single syntactic computational system and, crucially, two phonological systems. The current paper examines this possible architecture of the bilingual language faculty by utilizing code-switching data. Specifically, the natural speech of Maria, a habitual Spanish-English code-switcher from the Bangor Miami Corpus, was examined. For the interface of phonology, an analysis was completed on the frequency of syllabic structures used by Maria. Phonotactics were examined as the (unilingual) phonological systems of Spanish and English impose differential restrictions on the legality of complex onsets and codas. The results indicated that Maria’s language of use impacted the phonotactics of her speech, but that the context of use (unilingual or code-switched) did not. This suggests that Maria was alternating between encapsulated phonological systems when she was code-switching. For the interface of morphosyntax, syntactic dependencies within Maria’s code-switched speech and past literature were examined. The evidence illustrates that syntactic dependencies are indeed established within code-switched sentences, indicating that such constructions are derived from a single syntactic subset. Thus, the quantitative and qualitative results from this paper wholly support MacSwan’s original conjectures regarding the bilingual language faculty: bilingual cognition appears to be composed of a single computational system which builds multi-language syntactic structures, and more than one phonological system.

Read full abstract

It has been declared by the World Health Organization (WHO) the novel coronavirus a global pandemic due to an exponential spread in COVID-19 in the past months reaching over 100 million cases and resulting in approximately 3 million deaths worldwide. Amid this pandemic, identification of cyberbullying has become a more evolving area of research over posts or comments in social media platforms. In multilingual societies like India, code-switched texts comprise the majority of the Internet. Identifying the online bullying of the code-switched user is bit challenging than monolingual cases. As a first step towards enabling the development of approaches for cyberbullying detection, we developed a new code-switched dataset, collected from Twitter utterances annotated with binary labels. To demonstrate the utility of the proposed dataset, we build different machine learning (Support Vector Machine & Logistic Regression) and deep learning (Multilayer Perceptron, Convolution Neural Network, BiLSTM, BERT) algorithms to detect cyberbullying of English-Hindi (En-Hi) code-switched text. Our proposed model integrates different hand-crafted features and is enriched by sequential and semantic patterns generated by different state-of-the-art deep neural network models. Initial experimental results of the proposed deep ensemble model on our code-switched data reveal that our approach yields state-of-the-art results, i.e., 0.93 in terms of macro-averaged F1 score. The dataset and codes of the present study will be made publicly available on the paper’s companion repository [https://github.com/95sayanta/COVID-19-and-Cyberbullying].

Read full abstract

Code-switching Data Research Articles

Related Topics

Articles published on Code-switching Data

HC 2L: Hybrid and Cooperative Contrastive Learning for Cross-Lingual Spoken Language Understanding.

Implikasi Penggunaan Alih Kode Dalam Film “Ngeri-Ngeri Sedap” Terhadap Komunikasi Antar Remaja di Lingkungan Sekolah

Language Options in Food Product Advertising on Youtube

Deep Learning Approaches for English-Marathi Code-Switched Detection

ALIH KODE DAN CAMPUR KODE PADA DIALOG ANTARTOKOH FILM BUMI MANUSIA KARYA HANUNG BRAMANTYO

Code-switching input for machine translation: a case study of Vietnamese–English data

An Analysis of Code-Switching on The Utterances of Agnes Mo on Daniel Mananta Network's Youtube Channel

Types and Functions of Code-Switching in the Film Everything Everywhere All at Once

Alih Kode Pada Konten Vlog Dalam Kanal Youtube Turah Parthayana

A Review and Critical Analysis of Qualitative Methodologies and Data-Collection Techniques Used for Code-Switching Research

Kiswahili-English on Public Signage: A Morpheme- By -Morpheme Approach

The Analysis of the Sepedi-English Code-switched Radio News Corpus

ANALYSIS OF CODE-SWITCHING AND CODE-MIXING USED IN RINTIK SEDU YOUTUBE CHANNEL’S VIDEO

Heritage language maintenance and shift of three languages across three generations of Mountain Jews in Israel

Alih Kode Dan Campur Kode Dalam Video Blogger Suki Suka Japan

Building Educational Technologies for Code-Switching: Current Practices, Difficulties and Future Directions

Adapter-based fine-tuning of pre-trained multilingual language models for code-mixed and code-switched text classification

Code Mixing and Code Switching in Movie Murder on the Orient Express by Kenneth Branagh

Bilinguals have a single computational system but two compartmentalized phonological grammars: Evidence from code-switching

COVID-19 and cyberbullying: deep ensemble model to identify cyberbullying from code-switched languages during the pandemic.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Code-switching Data Research Articles

Related Topics

Articles published on Code-switching Data

HC 2L: Hybrid and Cooperative Contrastive Learning for Cross-Lingual Spoken Language Understanding.

Implikasi Penggunaan Alih Kode Dalam Film “Ngeri-Ngeri Sedap” Terhadap Komunikasi Antar Remaja di Lingkungan Sekolah

Language Options in Food Product Advertising on Youtube

Deep Learning Approaches for English-Marathi Code-Switched Detection

ALIH KODE DAN CAMPUR KODE PADA DIALOG ANTARTOKOH FILM BUMI MANUSIA KARYA HANUNG BRAMANTYO

Code-switching input for machine translation: a case study of Vietnamese–English data

An Analysis of Code-Switching on The Utterances of Agnes Mo on Daniel Mananta Network's Youtube Channel

Types and Functions of Code-Switching in the Film Everything Everywhere All at Once

Alih Kode Pada Konten Vlog Dalam Kanal Youtube Turah Parthayana

A Review and Critical Analysis of Qualitative Methodologies and Data-Collection Techniques Used for Code-Switching Research

Kiswahili-English on Public Signage: A Morpheme- By -Morpheme Approach

The Analysis of the Sepedi-English Code-switched Radio News Corpus

ANALYSIS OF CODE-SWITCHING AND CODE-MIXING USED IN RINTIK SEDU YOUTUBE CHANNEL’S VIDEO

Heritage language maintenance and shift of three languages across three generations of Mountain Jews in Israel

Alih Kode Dan Campur Kode Dalam Video Blogger Suki Suka Japan

Building Educational Technologies for Code-Switching: Current Practices, Difficulties and Future Directions

Adapter-based fine-tuning of pre-trained multilingual language models for code-mixed and code-switched text classification

Code Mixing and Code Switching in Movie Murder on the Orient Express by Kenneth Branagh

Bilinguals have a single computational system but two compartmentalized phonological grammars: Evidence from code-switching

COVID-19 and cyberbullying: deep ensemble model to identify cyberbullying from code-switched languages during the pandemic.