Semantic Uncertainty Research Articles

Background: Text mining derives information and patterns from textual data. Online social media platforms, which have recently acquired great interest, generate vast text data about human behaviors based on their interactions. This data is generally ambiguous and unstructured. The data includes typing errors and errors in grammar that cause lexical, syntactic, and semantic uncertainties. This results in incorrect pattern detection and analysis. Researchers are employing various text mining techniques that can aid in Topic Modeling, the detection of Trending Topics, the identification of Hate Speeches, and the growth of communities in online social media networks. Objective: This review paper compares the performance of ten machine learning classification techniques on a Twitter data set for analyzing users' sentiments on posts related to airline usage. Methods: Review and comparative analysis of Gaussian Naive Bayes, Random Forest, Multinomial Naive Bayes, Multinomial Naive Bayes with Bagging, Adaptive Boosting (AdaBoost), Optimized AdaBoost, Support Vector Machine (SVM), Optimized SVM, Logistic Regression, and Long-Short Term Memory (LSTM) for sentiment analysis. Results: The results of the experimental study showed that the Optimized SVM performed better than the other classifiers, with a training accuracy of 99.73% and testing accuracy of 89.74% compared to other models. Conclusion: Optimized SVM uses the RBF kernel function and nonlinear hyperplanes to split the dataset into classes, correctly classifying the dataset into distinct polarity. This, together with Feature Engineering utilizing Forward Trigrams and Weighted TF-IDF, has improved Optimized SVM classifier performance regarding train and test accuracy. Therefore, the train and test accuracy of Optimized SVM are 99.73% and 89.74% respectively. When compared to Random Forest, a marginal of 0.09% and 1.73% performance enhancement is observed in terms of train and test accuracy and 1.29% (train accuracy) and 3.63% (test accuracy) of improved performance when compared with LSTM. Likewise, Optimized SVM, gave more than 10% of enhanced performance in terms of train accuracy when compared with Gaussian Naïve Bayes, Multinomial Naïve Bayes, Multinomial Naïve Bayes with Bagging, Logistic Regression and a similar enhancement is observed with AdaBoost and Optimized AdaBoost which are ensemble models during the experimental process. Optimized SVM also has outperformed all the classification models in terms of AUC-ROC train and test scores.

Read full abstract

Accurate mapping of global urban man-made objects such as buildings and roads is critical for monitoring urbanization. Remote sensing imagery provides a cost-effective way of mapping these objects, but the challenge of “knowledge forgetting” arises due to urban diversity and the continuous growth of global samples. Although the existing knowledge distillation approaches can transfer knowledge from a larger teacher model to a smaller student model by distilling the knowledge learned from reliable labels, they fail to work for global-scale mapping, which lies in two aspects: low-quality labeling and fixed-size models. In this paper, we propose GUMONet, which is a semi-supervised knowledge distillation framework for global-scale urban man-made object mapping. For the first phase, a label diversity progressive learning module is introduced for generating high-quality labels in a semi-supervised manner. Label diversity is used to measure the diverse urban patterns based on spatial-semantic uncertainty, where the diversified labels clustered in object boundaries and heterogeneous areas are attributed to high spatial uncertainty and semantic uncertainty, respectively. Based on the label diversity, the model decision boundary is progressively determined from coarse to fine. Specifically, at the early stage, instances away from the decision boundary are selected to ensure the stability of the model training. As the iteration progresses, instances close to the decision boundary are associated with a higher probability of further enhancing the quality of the uncertain labels by hard sample mining. For the second phase, a size-variable knowledge distillation module is adopted to optimize the data-model matching process. This module consists of a noise teacher model that prevents overfitting by injecting noise perturbations to increase the data distribution complexity and a size-variable student model that avoids underfitting by dynamically adjusting its size with the growth of global samples. We applied GUMONet to six study areas across four continents, with data from different sensors, achieving an 18.97% improvement in intersection over union, compared with the previous methods. Our results also demonstrate a positive correlation between urban development and urban diversity, with a correlation coefficient of 0.749. As urban development progresses, urban diversity stabilizes and building transformation becomes the primary means of promoting further development.

Read full abstract

Semantic Uncertainty Research Articles

Related Topics

Articles published on Semantic Uncertainty

Semantic Segmentation Uncertainty Assessment of Different U-net Architectures for Extracting Building Footprints

Economical hybrid novelty detection leveraging global aleatoric semantic uncertainty for enhanced MRI-based ACL tear diagnosis

An end-to-end image-text matching approach considering semantic uncertainty

Progressive Decision Boundary Shifting for Unsupervised Domain Adaptation.

Methods of accounting for measurement uncertainty and semantic uncertainty in the classification of the state of objects according to quality indicators

Semantic Minimalism’s Truth-Evaluability: The Minimal Proposition and Semantic Uncertainty

Teacher’s Agency as an Indicator of Leadership Potential Development

Discourse factor of semantic uncertainty in reverse machine translation of media texts

Text Mining - A Comparative Review of Twitter Sentiments Analysis

Methods of judicial argumentation in case of semantic uncertainty of a legal text

Geometric Matching for Cross-Modal Retrieval.

NARRATIVE AS A MEANS OF SOLVING MEANINGFUL PROBLEMS IN A SITUATION OF UNCERTAINTY

Semantic uncertainty of the general theory of systems and problems of its interpretation and formalization

Semi-supervised knowledge distillation framework for global-scale urban man-made object remote sensing mapping

МЕХАНИЗМЫ ФОРМИРОВАНИЯ СОВРЕМЕННОЙ ДЕСТРУКТИВНОЙ РЕАЛЬНОСТИ: СОЦИАЛЬНО-КОГНИТИВНЫЙ АСПЕКТ

The paradox of tradition: Alexander Spendiaryan, Lev Revutsky, Igor Stravinsky

The effect of speech degradation on the ability to track and predict turn structure in conversation

ЧТО ОЗНАЧАЕТ "ПОТЕРЯТЬ ЛИЦО" В РУССКОМ ЯЗЫКЕ?

How Many Types of Justice are in Thomas Hobbes’ Leviathan?

Problems of semantic reconstruction of the sociological narrative

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Semantic Uncertainty Research Articles

Related Topics

Articles published on Semantic Uncertainty

Semantic Segmentation Uncertainty Assessment of Different U-net Architectures for Extracting Building Footprints

Economical hybrid novelty detection leveraging global aleatoric semantic uncertainty for enhanced MRI-based ACL tear diagnosis

An end-to-end image-text matching approach considering semantic uncertainty

Progressive Decision Boundary Shifting for Unsupervised Domain Adaptation.

Methods of accounting for measurement uncertainty and semantic uncertainty in the classification of the state of objects according to quality indicators

Semantic Minimalism’s Truth-Evaluability: The Minimal Proposition and Semantic Uncertainty

Teacher’s Agency as an Indicator of Leadership Potential Development

Discourse factor of semantic uncertainty in reverse machine translation of media texts

Text Mining - A Comparative Review of Twitter Sentiments Analysis

Methods of judicial argumentation in case of semantic uncertainty of a legal text

Geometric Matching for Cross-Modal Retrieval.

NARRATIVE AS A MEANS OF SOLVING MEANINGFUL PROBLEMS IN A SITUATION OF UNCERTAINTY

Semantic uncertainty of the general theory of systems and problems of its interpretation and formalization

Semi-supervised knowledge distillation framework for global-scale urban man-made object remote sensing mapping

МЕХАНИЗМЫ ФОРМИРОВАНИЯ СОВРЕМЕННОЙ ДЕСТРУКТИВНОЙ РЕАЛЬНОСТИ: СОЦИАЛЬНО-КОГНИТИВНЫЙ АСПЕКТ

The paradox of tradition: Alexander Spendiaryan, Lev Revutsky, Igor Stravinsky

The effect of speech degradation on the ability to track and predict turn structure in conversation

ЧТО ОЗНАЧАЕТ "ПОТЕРЯТЬ ЛИЦО" В РУССКОМ ЯЗЫКЕ?

How Many Types of Justice are in Thomas Hobbes’ Leviathan?

Problems of semantic reconstruction of the sociological narrative