Word2vec Model Research Articles

The COVID-19 pandemic caused several million deaths worldwide. Development of anti-coronavirus drugs is thus urgent. Unlike conventional non-peptide drugs, antiviral peptide drugs are highly specific, easy to synthesize and modify, and not highly susceptible to drug resistance. To reduce the time and expense involved in screening thousands of peptides and assaying their antiviral activity, computational predictors for identifying anti-coronavirus peptides (ACVPs) are needed. However, few experimentally verified ACVP samples are available, even though a relatively large number of antiviral peptides (AVPs) have been discovered. In this study, we attempted to predict ACVPs using an AVP dataset and a small collection of ACVPs. Using conventional features, a binary profile and a word-embedding word2vec (W2V), we systematically explored five different machine learning methods: Transformer, Convolutional Neural Network, bidirectional Long Short-Term Memory, Random Forest (RF) and Support Vector Machine. Via exhaustive searches, we found that the RF classifier with W2V consistently achieved better performance on different datasets. The two main controlling factors were: (i) the dataset-specific W2V dictionary was generated from the training and independent test datasets instead of the widely used general UniProt proteome and (ii) a systematic search was conducted and determined the optimal k-mer value in W2V, which provides greater discrimination between positive and negative samples. Therefore, our proposed method, named iACVP, consistently provides better prediction performance compared with existing state-of-the-art methods. To assist experimentalists in identifying putative ACVPs, we implemented our model as a web server accessible via the following link: http://kurata35.bio.kyutech.ac.jp/iACVP.

Read full abstract

In researching social network data and depression, it is often necessary to manually label depressed and non-depressed users, which is time-consuming and labor-intensive. The aim of this study is that it explores the relationship between social network data and depression. It can also contribute to detecting and identifying depression. Through collecting and analyzing college students' microblog social data, a preliminary screening algorithm for college students' suspected depression microblogs based on depression keywords, and semantic expansion is researched; a comprehensive lexical grammar was proposed. This research provided has a preliminary screening method based on depression keywords and semantic expansion for college students' suspected depression microblogs, with a screening accuracy. This method forms a depression keyword table by constructing the basic keyword table and the semantic expansion based on the word embedding learning model Word2Vec. Finally, the word table is used to calculate the semantic similarity of the tested microblogs and then identify whether it is a suspected depression microblog. The experimental results on the microblog dataset of college students show that the comprehensive lexical method is better than the SDS questionnaire segmentation method and the expert lexical method in terms of screening accuracy; the comprehensive lexical approach can quickly and automatically screen out a tiny proportion of suspected doubts from a large number of college students' microblogs. Depression Weibo can reduce the workload of experts' annotation, improve annotation efficiency, and provide a suitable data processing basis for the subsequent accurate identification (classification problem) of patients with depression.

Read full abstract

Word2vec Model Research Articles

Related Topics

Articles published on Word2vec Model

IACVP: markedly enhanced identification of anti-coronavirus peptides using a dataset-specific word2vec model.

Lightweight IDS Framework Using Word Embeddings for In-Vehicle Network Security

Evaluation of clustering techniques on Urdu News head-lines: a case of short length text

Deep learning method for Chinese multisource point of interest matching

LSTM (Long Short Term Memory) for Sentiment COVID-19 Vaccine Classification on Twitter

Detecting review spammer groups based on generative adversarial networks

Imputation of missing time-activity data with long-term gaps: A multi-scale residual CNN-LSTM network model

URBAN FUNCTIONAL ZONE IDENTIFICATION BY CONSIDERING THE HETEROGENEOUS DISTRIBUTION OF POINTS OF INTERESTS

Text Similarity Measurement Method and Application of Online Medical Community Based on Density Peak Clustering

Agenda Dynamics on Social Media During COVID-19 Pandemic: Interactions Between Public, Media, and Government Agendas

Word2vec model and HNSW index

Research on the Construction of Emergency Network Public Opinion Emotional Dictionary Based on Emotional Feature Extraction Algorithm.

The Construction and Trend of Feminist Literature Theory Based on Social Media Data Mining

Identification and Classification of Depressed Mental State for End-User over Social Media.

End-to-End Lip-Reading Open Cloud-Based Speech Architecture.

Framing the Cacerolazo: An Analysis of a Social Protest in Ecuador

Analysis of the causes of inferiority feelings based on social media data with Word2Vec

A New Method for WebShell Detection Based on Bidirectional GRU and Attention Mechanism

Detection of hyperpartisan news articles using natural language processing technique

Wiki sense bag creation using multilingual word sense disambiguation

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Word2vec Model Research Articles

Related Topics

Articles published on Word2vec Model

IACVP: markedly enhanced identification of anti-coronavirus peptides using a dataset-specific word2vec model.

Lightweight IDS Framework Using Word Embeddings for In-Vehicle Network Security

Evaluation of clustering techniques on Urdu News head-lines: a case of short length text

Deep learning method for Chinese multisource point of interest matching

LSTM (Long Short Term Memory) for Sentiment COVID-19 Vaccine Classification on Twitter

Detecting review spammer groups based on generative adversarial networks

Imputation of missing time-activity data with long-term gaps: A multi-scale residual CNN-LSTM network model

URBAN FUNCTIONAL ZONE IDENTIFICATION BY CONSIDERING THE HETEROGENEOUS DISTRIBUTION OF POINTS OF INTERESTS

Text Similarity Measurement Method and Application of Online Medical Community Based on Density Peak Clustering

Agenda Dynamics on Social Media During COVID-19 Pandemic: Interactions Between Public, Media, and Government Agendas

Word2vec model and HNSW index

Research on the Construction of Emergency Network Public Opinion Emotional Dictionary Based on Emotional Feature Extraction Algorithm.

The Construction and Trend of Feminist Literature Theory Based on Social Media Data Mining

Identification and Classification of Depressed Mental State for End-User over Social Media.

End-to-End Lip-Reading Open Cloud-Based Speech Architecture.

Framing the Cacerolazo: An Analysis of a Social Protest in Ecuador

Analysis of the causes of inferiority feelings based on social media data with Word2Vec

A New Method for WebShell Detection Based on Bidirectional GRU and Attention Mechanism

Detection of hyperpartisan news articles using natural language processing technique

Wiki sense bag creation using multilingual word sense disambiguation