Algorithm Affinity Propagation Research Articles

BackgroundClassification and naming is a key step in the analysis, understanding and adequate management of living organisms. However, where to set limits between groups can be puzzling especially in clonal organisms. Within the Mycobacterium tuberculosis complex (MTC), the etiological agent of tuberculosis (TB), experts have first identified several groups according to their pattern at repetitive sequences, especially at the CRISPR locus (spoligotyping), and to their epidemiological relevance. Most groups such as "Beijing" found good support when tested with other loci. However, other groups such as T family and T1 subfamily (belonging to the "Euro-American" lineage) correspond to non-monophyletic groups and still need to be refined. Here, we propose to use a method called Affinity Propagation that has been successfully used in image categorization to identify relevant patterns at the CRISPR locus in MTC.ResultsTo adequately infer the relative divergence time between strains, we used a distance method inspired by the recent evolutionary model by Reyes et al. We first confirm that this method performs better than the Jaccard index commonly used to compare spoligotype patterns. Second, we document the support of each spoligotype family among the previous classification using affinity propagation on the international spoligotyping database SpolDB4. This allowed us to propose a consensus assignation for all SpolDB4 spoligotypes. Third, we propose new signatures to subclassify the T family.ConclusionAltogether, this study shows how the new clustering algorithm Affinity Propagation can help building or refining clonal organims classifications. It also describes well-supported families and subfamilies among M. tuberculosis complex, especially inside the modern "Euro-American" lineage.

Read full abstract

As we enter the year of 2011, the 2009 H1N1 pandemic influenza virus is in the news again. At least 20 people have died of this virus in China since the beginning of 2011 and it is now the predominant flu strain in the country. Although this novel virus was quite stable during its run in the flu season of 2009-2010, a genetic variant of this virus was found in Singapore in early 2010, and then in Australia and New Zealand during their 2010 winter influenza season. Several critical mutations in the HA protein of this variant were uncovered in the strains collected from January 2010 to April 2010. Moreover, a structural homology model of HA from the A/Brisbane/10/2010(H1N1) strain was made based on the structure of A/California/04/2009 (H1N1). The purpose of this study was to investigate mutations in the HA protein of 2009 H1N1 from sequence data collected worldwide from May 2010 to February 2011. A fundamental problem in bioinformatics and biology is to find the similar gene sequences for a given gene sequence of interest. Here we proposed the inverse problem, i.e., finding the exemplars from a group of related gene sequences. With a clustering algorithm affinity propagation, six exemplars of the HA sequences were identified to represent six clusters. One of the clusters contained strain A/Brisbane/12/2010(H1N1) that only differed from A/Brisbane/10/2010 in the HA sequence at position 449. Based on the sequence identity of the six exemplars, nine mutations in HA were located that could be used to distinguish these six clusters. Finally, we discovered the change of correlation patterns for the HA and NA of 2009 H1N1 as a result of the HA receptor binding specificity switch, revealing the balanced interplay between these two surface proteins of the virus.

Read full abstract

Algorithm Affinity Propagation Research Articles

Related Topics

Articles published on Algorithm Affinity Propagation

Network Pruning Using Adaptive Exemplar Filters.

New Binding Mode of SLURP Protein to a7 Nicotinic Acetylcholine Receptor Revealed by Computer Simulations

BinSanity: unsupervised clustering of environmental microbial assemblies using coverage and affinity propagation.

Using affinity propagation for identifying subspecies among clonal organisms: lessons from M. tuberculosis

Text Clustering with Seeds Affinity Propagation

New mutational trends in the HA protein of 2009 H1N1 pandemic influenza virus from May 2010 to February 2011

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Algorithm Affinity Propagation Research Articles

Related Topics

Articles published on Algorithm Affinity Propagation

Network Pruning Using Adaptive Exemplar Filters.

New Binding Mode of SLURP Protein to a7 Nicotinic Acetylcholine Receptor Revealed by Computer Simulations

BinSanity: unsupervised clustering of environmental microbial assemblies using coverage and affinity propagation.

Using affinity propagation for identifying subspecies among clonal organisms: lessons from M. tuberculosis

Text Clustering with Seeds Affinity Propagation

New mutational trends in the HA protein of 2009 H1N1 pandemic influenza virus from May 2010 to February 2011