Bag-of-words Research Articles

With the introduction of Industry 4.0 into our lives and the creation of smart factories, predictive maintenance has become even more important. Predictive maintenance systems are often used in the manufacturing industry. On the other hand, text analysis and Natural Language Processing (NLP) techniques are gaining a lot of attention by both research and industry due to their ability to combine natural languages and industrial solutions. There is a great increase in the number of studies on NLP in the literature. Even though there are studies in the field of NLP in predictive maintenance systems, no studies were found on Turkish NLP for predictive maintenance. This study focuses on the similarity analysis of failure texts that can be used in the predictive maintenance system we developed for VESTEL, one of the leading consumer electronics manufacturers in Turkey. In the manufacturing industry, operators record descriptions of failure that occur on production lines as short texts. However, these descriptions are not often used in predictive maintenance work. In this study, semantic text similarities between fault definitions in the production line were compared using traditional word representations, modern word representations and Transformer models. Levenshtein, Jaccard, Pearson, and Cosine scales were used as similarity measures and the effectiveness of these measures were compared. Experimental data including failure texts were obtained from a consumer electronics manufacturer in Turkey. When the experimental results are examined, it is seen that the Jaccard similarity metric is not successful in grouping semantic similarities according to the other three similarity measures. In addition, Multilingual Universal Sentence Encoder (MUSE), Language-agnostic BERT Sentence Embedding (LAbSE), Bag of Words (BoW) and Term Frequency - Inverse Document Frequency (TF-IDF) outperform FastText and Language-Agnostic Sentence Representations (LASER) models in semantic discovery of error identification in embedding methods. Briefly to conclude, Pearson and Cosine are more effective at finding similar failure texts; MUSE, LAbSE, BoW and TF-IDF methods are more successful at representing the failure text.

Read full abstract

With the increasing usage of drugs to remedy different diseases, drug safety has become crucial over the past few years. Often medicine from several companies is offered for a single disease that involves the same/similar substances with slightly different formulae. Such diversification is both helpful and dangerous as such medicine proves to be more effective or shows side effects to different patients. Despite clinical trials, side effects are reported when the medicine is used by the mass public, of which several such experiences are shared on social media platforms. A system capable of analyzing such reviews could be very helpful to assist healthcare professionals and companies for evaluating the safety of drugs after it has been marketed. Sentiment analysis of drug reviews has a large potential for providing valuable insights into these cases. Therefore, this study proposes an approach to perform analysis on the drug safety reviews using lexicon-based and deep learning techniques. A dataset acquired from the ‘Drugs.Com’ containing reviews of drug-related side effects and reactions, is used for experiments. A lexicon-based approach, Textblob is used to extract the positive, negative or neutral sentiment from the review text. Review classification is achieved using a novel hybrid deep learning model of convolutional neural networks and long short-term memory (CNN-LSTM) network. The CNN is used at the first level to extract the appropriate features while LSTM is used at the second level. Several well-known machine learning models including logistic regression, random forest, decision tree, and AdaBoost are evaluated using term frequency-inverse document frequency (TF-IDF), a bag of words (BoW), feature union of (TF-IDF + BoW), and lexicon-based methods. Performance analysis with machine learning models, long short term memory and convolutional neural network models, and state-of-the-art approaches indicate that the proposed CNN-LSTM model shows superior performance with an 0.96 accuracy. We also performed a statistical significance T-test to show the significance of the proposed CNN-LSTM model in comparison with other approaches.

Read full abstract

Bag-of-words Research Articles

Related Topics

Articles published on Bag-of-words

Semantic Similarity Comparison Between Production Line Failures for Predictive Maintenance

A sophisticated semantic analysis framework using an intelligent tweet data clustering and classification methodologies

HFNet-SLAM: An Accurate and Real-Time Monocular SLAM System with Deep Features.

Hybrid classifier model with tuned weights for human activity recognition

Data reduction for X-ray serial crystallography using machine learning.

Automated Identification of Disaster News for Crisis Management using Machine Learning

Content-Based Image Retrieval System using Color Moment and Bag of Visual Words with Local Binary Pattern

Mining culture from professional discourse: a lexicon-based hybrid method

Proposed spatio‐temporal features for human activity classification using ensemble classification model

MBi-GRUMCONV: A novel Multi Bi-GRU and Multi CNN-Based deep learning model for social media sentiment analysis

An Emotion-Based Rating System for Books Using Sentiment Analysis and Machine Learning in the Cloud

Predicting bankruptcy of firms using earnings call data and transfer learning.

Hoax COVID-19 News Detection Based on Sentiment Analysis in Indonesian using Support Vector Machine (SVM) Method

Natural language processing of admission notes to predict postpartum hemorrhage

Drug Usage Safety from Drug Reviews with Hybrid Machine Learning Approach

Two Stage Job Title Identification System for Online Job Advertisements

Stress detection system for social media users

Object Recognition for Humanoid Robots Using Full Hand Tactile Sensor

RadFormer: Transformers with global-local attention for interpretable and accurate Gallbladder Cancer detection.

Workers’ perceptions of mHealth services for physical activity and mental health: A qualitative study using a text-mining method

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Bag-of-words Research Articles

Related Topics

Articles published on Bag-of-words

Semantic Similarity Comparison Between Production Line Failures for Predictive Maintenance

A sophisticated semantic analysis framework using an intelligent tweet data clustering and classification methodologies

HFNet-SLAM: An Accurate and Real-Time Monocular SLAM System with Deep Features.

Hybrid classifier model with tuned weights for human activity recognition

Data reduction for X-ray serial crystallography using machine learning.

Automated Identification of Disaster News for Crisis Management using Machine Learning

Content-Based Image Retrieval System using Color Moment and Bag of Visual Words with Local Binary Pattern

Mining culture from professional discourse: a lexicon-based hybrid method

Proposed spatio‐temporal features for human activity classification using ensemble classification model

MBi-GRUMCONV: A novel Multi Bi-GRU and Multi CNN-Based deep learning model for social media sentiment analysis

An Emotion-Based Rating System for Books Using Sentiment Analysis and Machine Learning in the Cloud

Predicting bankruptcy of firms using earnings call data and transfer learning.

Hoax COVID-19 News Detection Based on Sentiment Analysis in Indonesian using Support Vector Machine (SVM) Method

Natural language processing of admission notes to predict postpartum hemorrhage

Drug Usage Safety from Drug Reviews with Hybrid Machine Learning Approach

Two Stage Job Title Identification System for Online Job Advertisements

Stress detection system for social media users

Object Recognition for Humanoid Robots Using Full Hand Tactile Sensor

RadFormer: Transformers with global-local attention for interpretable and accurate Gallbladder Cancer detection.

Workers’ perceptions of mHealth services for physical activity and mental health: A qualitative study using a text-mining method