Semi-supervised Learning Approach Research Articles

The exploration of important biomarkers associated with cancer development is crucial for diagnosing cancer, designing therapeutic interventions, and predicting prognoses. The analysis of gene co-expression provides a systemic perspective on gene networks and can be a valuable tool for mining biomarkers. The main objective of co-expression network analysis is to discover highly synergistic sets of genes, and the most widely used method is weighted gene co-expression network analysis (WGCNA). With the Pearson correlation coefficient, WGCNA measures gene correlation, and uses hierarchical clustering to identify gene modules. The Pearson correlation coefficient reflects only the linear dependence between variables, and the main drawback of hierarchical clustering is that once two objects are clustered together, the process cannot be reversed. Hence, readjusting inappropriate cluster divisions is not possible. Existing co-expression network analysis methods rely on unsupervised methods that do not utilize prior biological knowledge for module delineation. Here we present a method for identification of outstanding modules in a co-expression network using a knowledge-injected semi-supervised learning approach (KISL), which utilizes apriori biological knowledge and a semi-supervised clustering method to address the issue existing in the current GCN-based clustering methods. To measure the linear and non-linear dependence between genes, we introduce a distance correlation due to the complexity of the gene-gene relationship. Eight RNA-seq datasets of cancer samples are used to validate its effectiveness. In all eight datasets, the KISL algorithm outperformed WGCNA when comparing the silhouette coefficient, Calinski-Harabasz index and Davies-Bouldin index evaluation metrics. According to the results, KISL clusters had better cluster evaluation values and better gene module aggregation. Enrichment analysis of the recognition modules demonstrated their effectiveness in discovering modular structures in biological co-expression networks. In addition, as a general method, KISL can be applied to various co-expression network analyses based on similarity metrics. Source codes for the KISL and the related scripts are available online at https://github.com/Mowonhoo/KISL.git.

Read full abstract

Literature on supervised Machine-Learning (ML) approaches for classifying text-based safety reports for the construction sector has been growing. Recent studies have emphasized the need to build ML approaches that balance high classification accuracy and performance on management criteria, such as resource intensiveness. However, despite being highly accurate, the extensively focused, supervised ML approaches may not perform well on management criteria as many factors contribute to their resource intensiveness. Alternatively, the potential for semi-supervised ML approaches to achieve balanced performance has rarely been explored in the construction safety literature. The current study contributes to the scarce knowledge on semi-supervised ML approaches by demonstrating the applicability of a state-of-the-art semi-supervised learning approach, i.e., Yet, Another Keyword Extractor (YAKE) integrated with Guided Latent Dirichlet Allocation (GLDA) for construction safety report classification. Construction-safety-specific knowledge is extracted as keywords through YAKE, relying on accessible literature with minimal manual intervention. Keywords from YAKE are then seeded in the GLDA model for the automatic classification of safety reports without requiring a large quantity of prelabeled datasets. The YAKE-GLDA classification performance (F1 score of 0.66) is superior to existing unsupervised methods for the benchmark data containing injury narratives from Occupational Health and Safety Administration (OSHA). The YAKE-GLDA approach is also applied to near-miss safety reports from a construction site. The study demonstrates a high degree of generality of the YAKE-GLDA approach through a moderately high F1 score of 0.86 for a few categories in the near-miss data. The current research demonstrates that, unlike the existing supervised approaches, the semi-supervised YAKE-GLDA approach can achieve a novel possibility of consistently achieving reasonably good classification performance across various construction-specific safety datasets yet being resource-efficient. Results from an objective comparative and sensitivity analysis contribute to much-required knowledge-contesting insights into the functioning and applicability of the YAKE-GLDA. The results from the current study will help construction organizations implement and optimize an efficient ML-based knowledge-mining strategy for domains beyond safety and across sites where the availability of a pre-labeled dataset is a significant limitation.

Read full abstract

Semi-supervised Learning Approach Research Articles

Related Topics

Articles published on Semi-supervised Learning Approach

Leveraging Structure for Improved Classification of Grouped Biased Data

T5-Based Model for Abstractive Summarization: A Semi-Supervised Learning Approach with Consistency Loss Functions

Robust Detection, Segmentation, and Metrology of High Bandwidth Memory 3D Scans Using an Improved Semi-Supervised Deep Learning Approach

SSMD-UNet: semi-supervised multi-task decoders network for diabetic retinopathy segmentation

Advancements in AI for Brain Tumor Detection and Classification

Gaussian transformation enhanced semi-supervised learning for sleep stage classification

MIM-Graph: A multi-sensor network approach for fault diagnosis of HSR Bogie bearings at the IoT edge via mutual information maximization

KISL: knowledge-injected semi-supervised learning for biological co-expression network modules.

MsDroid: Identifying Malicious Snippets for Android Malware Detection

Simultaneous multi-crop land suitability prediction from remote sensing data using semi-supervised learning

Identifying Fraudsters and Fraudulent Strategies in Mobile Social Network

Adaptive inverse mapping: a model-free semi-supervised learning approach towards robust imaging through dynamic scattering media.

SpaDecon: cell-type deconvolution in spatial transcriptomics with semi-supervised learning

Semisupervised Learning for Noise Suppression Using Deep Reinforcement Learning of Contrastive Features

SEBD: A Stream Evolving Bot Detection Framework with Application of PAC Learning Approach to Maintain Accuracy and Confidence Levels

Classification of operational states in porous journal bearings using a semi-supervised multi-sensor Machine Learning approach

Anomaly Detection for Sensor Signals Utilizing Deep Learning Autoencoder-Based Neural Networks.

A multi-task and multi-channel convolutional neural network for semi-supervised neonatal artefact detection

Deep reinforcement learning for data-efficient weakly supervised business process anomaly detection

Automatic classification of construction safety reports using semi-supervised YAKE-Guided LDA approach

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Semi-supervised Learning Approach Research Articles

Related Topics

Articles published on Semi-supervised Learning Approach

Leveraging Structure for Improved Classification of Grouped Biased Data

T5-Based Model for Abstractive Summarization: A Semi-Supervised Learning Approach with Consistency Loss Functions

Robust Detection, Segmentation, and Metrology of High Bandwidth Memory 3D Scans Using an Improved Semi-Supervised Deep Learning Approach

SSMD-UNet: semi-supervised multi-task decoders network for diabetic retinopathy segmentation

Advancements in AI for Brain Tumor Detection and Classification

Gaussian transformation enhanced semi-supervised learning for sleep stage classification

MIM-Graph: A multi-sensor network approach for fault diagnosis of HSR Bogie bearings at the IoT edge via mutual information maximization

KISL: knowledge-injected semi-supervised learning for biological co-expression network modules.

MsDroid: Identifying Malicious Snippets for Android Malware Detection

Simultaneous multi-crop land suitability prediction from remote sensing data using semi-supervised learning

Identifying Fraudsters and Fraudulent Strategies in Mobile Social Network

Adaptive inverse mapping: a model-free semi-supervised learning approach towards robust imaging through dynamic scattering media.

SpaDecon: cell-type deconvolution in spatial transcriptomics with semi-supervised learning

Semisupervised Learning for Noise Suppression Using Deep Reinforcement Learning of Contrastive Features

SEBD: A Stream Evolving Bot Detection Framework with Application of PAC Learning Approach to Maintain Accuracy and Confidence Levels

Classification of operational states in porous journal bearings using a semi-supervised multi-sensor Machine Learning approach

Anomaly Detection for Sensor Signals Utilizing Deep Learning Autoencoder-Based Neural Networks.

A multi-task and multi-channel convolutional neural network for semi-supervised neonatal artefact detection

Deep reinforcement learning for data-efficient weakly supervised business process anomaly detection

Automatic classification of construction safety reports using semi-supervised YAKE-Guided LDA approach