Deep Neural Language Models Research Articles

A sentence is more than the sum of its words: its meaning depends on how they combine with one another. The brain mechanisms underlying such semantic composition remain poorly understood. To shed light on the neural vector code underlying semantic composition, we introduce two hypotheses: (1) the intrinsic dimensionality of the space of neural representations should increase as a sentence unfolds, paralleling the growing complexity of its semantic representation; and (2) this progressive integration should be reflected in ramping and sentence-final signals. To test these predictions, we designed a dataset of closely matched normal and jabberwocky sentences (composed of meaningless pseudo words) and displayed them to deep language models and to 11 human participants (5 men and 6 women) monitored with simultaneous MEG and intracranial EEG. In both deep language models and electrophysiological data, we found that representational dimensionality was higher for meaningful sentences than jabberwocky. Furthermore, multivariate decoding of normal versus jabberwocky confirmed three dynamic patterns: (1) a phasic pattern following each word, peaking in temporal and parietal areas; (2) a ramping pattern, characteristic of bilateral inferior and middle frontal gyri; and (3) a sentence-final pattern in left superior frontal gyrus and right orbitofrontal cortex. These results provide a first glimpse into the neural geometry of semantic integration and constrain the search for a neural code of linguistic composition.SIGNIFICANCE STATEMENT Starting from general linguistic concepts, we make two sets of predictions in neural signals evoked by reading multiword sentences. First, the intrinsic dimensionality of the representation should grow with additional meaningful words. Second, the neural dynamics should exhibit signatures of encoding, maintaining, and resolving semantic composition. We successfully validated these hypotheses in deep neural language models, artificial neural networks trained on text and performing very well on many natural language processing tasks. Then, using a unique combination of MEG and intracranial electrodes, we recorded high-resolution brain data from human participants while they read a controlled set of sentences. Time-resolved dimensionality analysis showed increasing dimensionality with meaning, and multivariate decoding allowed us to isolate the three dynamical patterns we had hypothesized.

Read full abstract

BackgroundThere is a limited amount of data on the safety profile of the COVID-19 vector vaccine Gam-COVID-Vac (Sputnik V). Previous infodemiology studies showed that social media discourse could be analyzed to assess the most concerning adverse events (AE) caused by drugs.ObjectiveWe aimed to investigate mild AEs of Sputnik V based on a participatory trial conducted on Telegram in the Russian language. We compared AEs extracted from Telegram with other limited databases on Sputnik V and other COVID-19 vaccines. We explored symptom co-occurrence patterns and determined how counts of administered doses, age, gender, and sequence of shots could confound the reporting of AEs.MethodsWe collected a unique dataset consisting of 11,515 self-reported Sputnik V vaccine AEs posted on the Telegram group, and we utilized natural language processing methods to extract AEs. Specifically, we performed multilabel classifications using the deep neural language model Bidirectional Encoder Representations from Transformers (BERT) “DeepPavlov,” which was pretrained on a Russian language corpus and applied to the Telegram messages. The resulting area under the curve score was 0.991. We chose symptom classes that represented the following AEs: fever, pain, chills, fatigue, nausea/vomiting, headache, insomnia, lymph node enlargement, erythema, pruritus, swelling, and diarrhea.ResultsTelegram users complained mostly about pain (5461/11,515, 47.43%), fever (5363/11,515, 46.57%), fatigue (3862/11,515, 33.54%), and headache (2855/11,515, 24.79%). Women reported more AEs than men (1.2-fold, P<.001). In addition, there were more AEs from the first dose than from the second dose (1.1-fold, P<.001), and the number of AEs decreased with age (β=.05 per year, P<.001). The results also showed that Sputnik V AEs were more similar to other vector vaccines (132 units) than with messenger RNA vaccines (241 units) according to the average Euclidean distance between the vectors of AE frequencies. Elderly Telegram users reported significantly more (5.6-fold on average) systemic AEs than their peers, according to the results of the phase 3 clinical trials published in The Lancet. However, the AEs reported in Telegram posts were consistent (Pearson correlation r=0.94, P=.02) with those reported in the Argentinian postmarketing AE registry.ConclusionsAfter the Sputnik V vaccination, Russian Telegram users reported mostly pain, fever, and fatigue. The Sputnik V AE profile was comparable with other vector COVID-19 vaccines. Discussion on social media could provide meaningful information about the AE profile of novel vaccines.

Read full abstract

Deep Neural Language Models Research Articles

Related Topics

Articles published on Deep Neural Language Models

SentinelLMs: Encrypted Input Adaptation and Fine-Tuning of Language Models for Private and Secure Inference

Dimensionality and Ramping: Signatures of Sentence Integration in the Dynamics of Brains and Deep Language Models.

Cross-Domain Sentiment Analysis Based on Small in-Domain Fine-Tuning

Improving the robustness and accuracy of biomedical language models through adversarial training

Cheap talk and cherry-picking: What ClimateBert has to say on corporate climate risk disclosures

Comparison of Structural Parsers and Neural Language Models as Surprisal Estimators.

Mild Adverse Events of Sputnik V Vaccine in Russia: Social Media Content Analysis of Telegram via Deep Learning.

Evaluation of taxonomic and neural embedding methods for calculating semantic similarity

Information Retrieval in an Infodemic: The Case of COVID-19 Publications.

Augmenting commit classification by using fine-grained source code changes and a pre-trained deep neural language model

Sense representations for Portuguese: experiments with sense embeddings and deep neural language models

Towards Improved Classification Accuracy on Highly Imbalanced Text Dataset Using Deep Neural Language Models

Cheap Talk and Cherry-Picking: What ClimateBert has to say on Corporate Climate Risk Disclosures

GSAM: A deep neural network model for extracting computational representations of Chinese addresses fused with geospatial feature

Can Machines Tell Stories? A Comparative Study of Deep Neural Language Models and Metrics

Bilateral neural embedding for collaborative filtering-based multimedia recommendation

Mining e-cigarette adverse events in social media using Bi-LSTM recurrent neural network with word embedding representation.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Deep Neural Language Models Research Articles

Related Topics

Articles published on Deep Neural Language Models

SentinelLMs: Encrypted Input Adaptation and Fine-Tuning of Language Models for Private and Secure Inference

Dimensionality and Ramping: Signatures of Sentence Integration in the Dynamics of Brains and Deep Language Models.

Cross-Domain Sentiment Analysis Based on Small in-Domain Fine-Tuning

Improving the robustness and accuracy of biomedical language models through adversarial training

Cheap talk and cherry-picking: What ClimateBert has to say on corporate climate risk disclosures

Comparison of Structural Parsers and Neural Language Models as Surprisal Estimators.

Mild Adverse Events of Sputnik V Vaccine in Russia: Social Media Content Analysis of Telegram via Deep Learning.

Evaluation of taxonomic and neural embedding methods for calculating semantic similarity

Information Retrieval in an Infodemic: The Case of COVID-19 Publications.

Augmenting commit classification by using fine-grained source code changes and a pre-trained deep neural language model

Sense representations for Portuguese: experiments with sense embeddings and deep neural language models

Towards Improved Classification Accuracy on Highly Imbalanced Text Dataset Using Deep Neural Language Models

Cheap Talk and Cherry-Picking: What ClimateBert has to say on Corporate Climate Risk Disclosures

GSAM: A deep neural network model for extracting computational representations of Chinese addresses fused with geospatial feature

Can Machines Tell Stories? A Comparative Study of Deep Neural Language Models and Metrics

Bilateral neural embedding for collaborative filtering-based multimedia recommendation

Mining e-cigarette adverse events in social media using Bi-LSTM recurrent neural network with word embedding representation.