Style Algorithm Research Articles

Periodicity is a frequently happening phenomenon for social interactions in temporal networks. Mining periodic communities are essential to understanding periodic group behaviors in temporal networks. Unfortunately, most previous studies for community mining in temporal networks ignore the periodic patterns of communities. In this paper, we study the problem of seeking periodic communities in a temporal network, where each edge is associated with a set of timestamps. We propose novel models, including <inline-formula><tex-math notation="LaTeX">$\sigma$</tex-math></inline-formula> -periodic <inline-formula><tex-math notation="LaTeX">$k$</tex-math></inline-formula> -core and <inline-formula><tex-math notation="LaTeX">$\sigma$</tex-math></inline-formula> -periodic <inline-formula><tex-math notation="LaTeX">$k$</tex-math></inline-formula> -clique, that represent periodic communities in temporal networks. Specifically, a <inline-formula><tex-math notation="LaTeX">$\sigma$</tex-math></inline-formula> -periodic <inline-formula><tex-math notation="LaTeX">$k$</tex-math></inline-formula> -core (or <inline-formula><tex-math notation="LaTeX">$\sigma$</tex-math></inline-formula> -periodic <inline-formula><tex-math notation="LaTeX">$k$</tex-math></inline-formula> -clique) is a <inline-formula><tex-math notation="LaTeX">$k$</tex-math></inline-formula> -core (or clique with size larger than <inline-formula><tex-math notation="LaTeX">$k$</tex-math></inline-formula> ) that appears at least <inline-formula><tex-math notation="LaTeX">$\sigma$</tex-math></inline-formula> times periodically in the temporal graph. The problem of searching periodic core is efficient but the resulting communities may be not enough cohesive; the problem of enumerating all periodic cliques is not efficient (NP-hard) but the resulting communities are very cohesive. To compute all of them efficiently, we first develop two effective graph reduction techniques to significantly prune the temporal graph. Then, we transform the temporal graph into a static graph and prove that mining the periodic communities in the temporal graph equals mining communities in the transformed graph. Subsequently, we propose a decomposition algorithm to search maximal <inline-formula><tex-math notation="LaTeX">$\sigma$</tex-math></inline-formula> -periodic <inline-formula><tex-math notation="LaTeX">$k$</tex-math></inline-formula> -core, a Bron-Kerbosch style algorithm to enumerate all maximal <inline-formula><tex-math notation="LaTeX">$\sigma$</tex-math></inline-formula> -periodic <inline-formula><tex-math notation="LaTeX">$k$</tex-math></inline-formula> -cliques, and a branch-and-bound style algorithm to find the maximum <inline-formula><tex-math notation="LaTeX">$\sigma$</tex-math></inline-formula> -periodic clique. The results of extensive experiments on five real-life datasets demonstrate the efficiency, scalability, and effectiveness of our algorithms.

Read full abstract

Streptococcus pneumoniae typically express one of 92 serologically distinct capsule polysaccharide (cps) types (serotypes). Some of these serotypes are closely related to each other; using the commercially available typing antisera, these are assigned to common serogroups containing types that show cross-reactivity. In this serotyping scheme, factor antisera are used to allocate serotypes within a serogroup, based on patterns of reactions. This serotyping method is technically demanding, requires considerable experience and the reading of the results can be subjective. This study describes the analysis of the S. pneumoniae capsular operon genetic sequence to determine serotype distinguishing features and the development, evaluation and verification of an automated whole genome sequence (WGS)-based serotyping bioinformatics tool, PneumoCaT (Pneumococcal Capsule Typing). Initially, WGS data from 871 S. pneumoniae isolates were mapped to reference cps locus sequences for the 92 serotypes. Thirty-two of 92 serotypes could be unambiguously identified based on sequence similarities within the cps operon. The remaining 60 were allocated to one of 20 ‘genogroups’ that broadly correspond to the immunologically defined serogroups. By comparing the cps reference sequences for each genogroup, unique molecular differences were determined for serotypes within 18 of the 20 genogroups and verified using the set of 871 isolates. This information was used to design a decision-tree style algorithm within the PneumoCaT bioinformatics tool to predict to serotype level for 89/94 (92 + 2 molecular types/subtypes) from WGS data and to serogroup level for serogroups 24 and 32, which currently comprise 2.1% of UK referred, invasive isolates submitted to the National Reference Laboratory (NRL), Public Health England (June 2014–July 2015). PneumoCaT was evaluated with an internal validation set of 2065 UK isolates covering 72/92 serotypes, including 19 non-typeable isolates and an external validation set of 2964 isolates from Thailand (n = 2,531), USA (n = 181) and Iceland (n = 252). PneumoCaT was able to predict serotype in 99.1% of the typeable UK isolates and in 99.0% of the non-UK isolates. Concordance was evaluated in UK isolates where further investigation was possible; in 91.5% of the cases the predicted capsular type was concordant with the serologically derived serotype. Following retesting, concordance increased to 99.3% and in most resolved cases (97.8%; 135/138) discordance was shown to be caused by errors in original serotyping. Replicate testing demonstrated that PneumoCaT gave 100% reproducibility of the predicted serotype result. In summary, we have developed a WGS-based serotyping method that can predict capsular type to serotype level for 89/94 serotypes and to serogroup level for the remaining four. This approach could be integrated into routine typing workflows in reference laboratories, reducing the need for phenotypic immunological testing.

Read full abstract

Style Algorithm Research Articles

Related Topics

Articles published on Style Algorithm

Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Conservative Natural Policy Gradient Primal-Dual Algorithm

Reinforcement learning with dynamic convex risk measures

Co-Learning Bayesian Optimization.

A k-means method for trends of time series: An application to time series of COVID-19 cases in Japan.

Dose calculation and reporting with a linear Boltzman transport equation solver in vertebral SABR.

Data driven orthogonal basis selection for functional data analysis

A novel sub-Kmeans based on co-training approach by transforming single-view into multi-view

Periodic Communities Mining in Temporal Networks: Concepts and Algorithms

Data Anomaly Detection for Internet of Vehicles Based on Traffic Cellular Automata and Driving Style.

Improving parallel efficiency for asynchronous graph analytics using Gauss‐Seidel‐based matrix computation

Improved Online Sequential Extreme Learning Machine: A New Intelligent Evaluation Method for AZ-Style Algorithms

A Sparse Manifold Learning Approach to Robust Indoor Positioning Based on Wi-Fi RSS Fingerprinting

A polynomial-time algorithm to compute generalized Hermite normal forms of matrices over [formula omitted

Multi-view clustering: A survey

Dual Set Multi-Label Learning

P.3.b.043 - Two-year follow-up of patients with a first psychotic episode: comparison between affective and non-affective psychoses and predictors of functioning

Software Physiognomics: Adorno's Radio Analytics Today

Dense registration of fingerprints

Whole genome sequencing of Streptococcus pneumoniae: development, evaluation and verification of targets for serogroup and serotype prediction using an automated pipeline.

Improving semi-supervised self-training with embedded manifold transduction

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Style Algorithm Research Articles

Related Topics

Articles published on Style Algorithm

Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Conservative Natural Policy Gradient Primal-Dual Algorithm

Reinforcement learning with dynamic convex risk measures

Co-Learning Bayesian Optimization.

A k-means method for trends of time series: An application to time series of COVID-19 cases in Japan.

Dose calculation and reporting with a linear Boltzman transport equation solver in vertebral SABR.

Data driven orthogonal basis selection for functional data analysis

A novel sub-Kmeans based on co-training approach by transforming single-view into multi-view

Periodic Communities Mining in Temporal Networks: Concepts and Algorithms

Data Anomaly Detection for Internet of Vehicles Based on Traffic Cellular Automata and Driving Style.

Improving parallel efficiency for asynchronous graph analytics using Gauss‐Seidel‐based matrix computation

Improved Online Sequential Extreme Learning Machine: A New Intelligent Evaluation Method for AZ-Style Algorithms

A Sparse Manifold Learning Approach to Robust Indoor Positioning Based on Wi-Fi RSS Fingerprinting

A polynomial-time algorithm to compute generalized Hermite normal forms of matrices over [formula omitted

Multi-view clustering: A survey

Dual Set Multi-Label Learning

P.3.b.043 - Two-year follow-up of patients with a first psychotic episode: comparison between affective and non-affective psychoses and predictors of functioning

Software Physiognomics: Adorno's Radio Analytics Today

Dense registration of fingerprints

Whole genome sequencing of Streptococcus pneumoniae: development, evaluation and verification of targets for serogroup and serotype prediction using an automated pipeline.

Improving semi-supervised self-training with embedded manifold transduction