Graph-based Technique Research Articles

The human microbiome, consisting of diverse bacte-rial, fungal, protozoan and viral species, exerts a profound influence on various physiological processes and disease susceptibility. However, the complexity of microbiome data has presented significant challenges in the analysis and interpretation of these intricate datasets, leading to the development of specialized software that employs machine learning algorithms for these aims. In this paper, we analyze raw data taken from 16S rRNA gene sequencing from three studies, including stool samples from healthy control, patients with adenoma, and patients with colorectal cancer. Firstly, we use network-based methods to reduce dimensions of the dataset and consider only the most important features. In addition, we employ supervised machine learning algorithms to make prediction. Results show that graph-based techniques reduces dimen-sion from 255 up to 78 features with modularity score 0.73 based on different centrality measures. On the other hand, projection methods (non-negative matrix factorization and principal component analysis) reduce dimensions to 7 features. Furthermore, we apply supervised machine learning algorithms on the most important features obtained from centrality measures and on the ones obtained from projection methods, founding that the evaluation metrics have approximately the same scores when applying the algorithms on the entire dataset, on 78 feature and on 7 features. This study demonstrates the efficacy of graph-based and projection methods in the interpretation for 16S rRNA gene sequencing data. Supervised machine learning on refined features from both approaches yields comparable predictive performance, emphasizing specific microbial features-bacteroides, prevotella, fusobacterium, lysinibacillus, blautia, sphingomonas, and faecalibacterium-as key in predicting patient conditions from raw data.

Read full abstract

The article’s subject matter is the processing of abdominal EMG recordings and finding breathing patterns. The goal is to automatically classify respiratory patterns into two classes, or clusters, by two breathing patterns, regular and irregular, using machine learning (ML) methods. The object of the study was to obtain a dataset of 40 randomly picked abdominal EMG recordings (sampling rate equal to 200 Hz) borrowed from the complete dataset published by the Computational Clinical Neurophysiology Laboratory and the Clinical Data Animation Laboratory of Massachusetts General Hospital. The tasks to be solved are as follows: finding ETS (errors-trend-seasonality) model for the EMG series using the exponential smoothing method; obtaining denoised and detrended signals; obtaining the Hurst exponents for EMGs using the power-law decaying of correlograms for the denoised and detrended signals; describing the variabilities, SNR, the outlier fractions, and Hurst exponents by robust statistics, performing correlation analysis, and Principal Components Analysis (PCA); analyzing the structure of the distant matrix by a graph-based technique; obtaining the periodograms in the frequency domain using the known Wiener-Khinchin theorem; and finding the best models and methods of classification and clusterization and evaluating them within modern Machine Learning methods. The methods used are exponential smoothing, the Wiener-Khinchin theorem, the graph theory method, principal component analysis, programing within MAPLE 2020, and data processing by Weka. The authors obtained the following results: 1) wide data variability has been rated with the median absolute deviations, which is the most robust statistic in this case; 2) most of the signals (38 of 40) showed frequent outliers: from a few percent up to 24.6 % of emissions; 3) these four variables: outliers' percentage, variability, SNR, and persistency factors – form the attributes of input vectors of the subjects for further Machine Learning with Weka software; 4) Manhattan distances matrix among subjects' vectors in 4D attributes space allows imaging the data set as a weighted graph, the vertices of which are subjects; 5) the weights of the graph's edges reflect distances between any pair of them. "Closeness centralities" of vertices allowed us to cluster the data set on two clusters with 11 and 29 subjects, and Weka clustering algorithms confirmed this result. 6) The learning curve shows that a sufficiently small data set (from 25 subjects) might be suitable for classification purposes. Conclusions. The scientific novelty of the results obtained is as follows: 1) the Error-Trend-Seasonality model was the same for all data sets. Abdominal EMG of sleeping patients had additive errors and undamped trends without any seasonality; 2) the correlograms' decaying according to power law had been set, and Hurst exponents were in the range (of 0.776–0.887). This testifies to "long memory" (high persistence) of abdominal EMGs; 3) the modified Z-scores and robust statistics with the highest breakdown values were used for the EMG parameters because of many outliers; 4) breathing patterns were set using the periodograms in the frequency domain using the Wiener-Khinchin theorem; 5) the new graph-based method was successfully exploited to cluster the dataset. Parallel clustering with Weka algorithms confirmed the graph-based clustering results.

Read full abstract

Graph-based Technique Research Articles

Related Topics

Articles published on Graph-based Technique

Factor Graph-Based Technique for Trajectory Tracking of Target with High Mobility

EnzyACT: A Novel Deep Learning Method to Predict the Impacts of Single and Multiple Mutations on Enzyme Activity.

Review on graph theory-based image segmentation with its methods

LG-GNN: Local and Global Information-aware Graph Neural Network for default detection

Anomaly Detection in Dynamic Graphs: A Comprehensive Survey

Investigating Causes of Disputes Resulting in Litigation in Airport Development Projects in the United States Using Graph-Based Techniques

GHGDroid: Global heterogeneous graph-based android malware detection

Generative Architecture for Data Imputation in Secure Blockchain-enabled Spatiotemporal Data Management

GAMC: An Unsupervised Method for Fake News Detection Using Graph Autoencoder with Masking

A Graph-Based Technique for the Automated Control-Oriented Modeling of District Heating Networks

Attention-Based Learning for Predicting Drug-Drug Interactions in Knowledge Graph Embedding Based on Multisource Fusion Information

A comparative study of supervised and unsupervised machine learning algorithms applied to human microbiome.

Bridging directed acyclic graphs to linear representations in linear genetic programming: a case study of dynamic scheduling

ProSTAGE: Predicting Effects of Mutations on Protein Stability by Using Protein Embeddings and Graph Convolutional Networks.

Exploring fMRI RDMs: enhancing model robustness through neurobiological data

Application of Artificial Intelligence Techniques to Detect Fake News: A Review

A Graph-Based Technique for Securing the Distributed Cyber-Physical System Infrastructure

Abdominal electromyograms mining: breathing patterns of asleep adults

Sludge bulking monitoring in industrial wastewater treatment plants through graphical methods: A dynamic graph embedding and Bayesian networks approach

Marathi Text Summarization using Extractive Technique

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Graph-based Technique Research Articles

Related Topics

Articles published on Graph-based Technique

Factor Graph-Based Technique for Trajectory Tracking of Target with High Mobility

EnzyACT: A Novel Deep Learning Method to Predict the Impacts of Single and Multiple Mutations on Enzyme Activity.

Review on graph theory-based image segmentation with its methods

LG-GNN: Local and Global Information-aware Graph Neural Network for default detection

Anomaly Detection in Dynamic Graphs: A Comprehensive Survey

Investigating Causes of Disputes Resulting in Litigation in Airport Development Projects in the United States Using Graph-Based Techniques

GHGDroid: Global heterogeneous graph-based android malware detection

Generative Architecture for Data Imputation in Secure Blockchain-enabled Spatiotemporal Data Management

GAMC: An Unsupervised Method for Fake News Detection Using Graph Autoencoder with Masking

A Graph-Based Technique for the Automated Control-Oriented Modeling of District Heating Networks

Attention-Based Learning for Predicting Drug-Drug Interactions in Knowledge Graph Embedding Based on Multisource Fusion Information

A comparative study of supervised and unsupervised machine learning algorithms applied to human microbiome.

Bridging directed acyclic graphs to linear representations in linear genetic programming: a case study of dynamic scheduling

ProSTAGE: Predicting Effects of Mutations on Protein Stability by Using Protein Embeddings and Graph Convolutional Networks.

Exploring fMRI RDMs: enhancing model robustness through neurobiological data

Application of Artificial Intelligence Techniques to Detect Fake News: A Review

A Graph-Based Technique for Securing the Distributed Cyber-Physical System Infrastructure

Abdominal electromyograms mining: breathing patterns of asleep adults

Sludge bulking monitoring in industrial wastewater treatment plants through graphical methods: A dynamic graph embedding and Bayesian networks approach

Marathi Text Summarization using Extractive Technique