Single Nucleotide Polymorphism Patterns Research Articles

Cassava Brown Streak Disease (CBSD), which is caused by cassava brown streak virus (CBSV) and Ugandan cassava brown streak virus (UCBSV), represents one of the most devastating threats to cassava production in Africa, including in Rwanda where a dramatic epidemic in 2014 dropped cassava yield from 3.3 million to 900,000 tonnes (1). Studying viral genetic diversity at the genome level is essential in disease management, as it can provide valuable information on the origin and dynamics of epidemic events. To fill the current lack of genome-based diversity studies of UCBSV, we performed a nationwide survey of cassava ipomovirus genomic sequences in Rwanda by high-throughput sequencing (HTS) of pools of plants sampled from 130 cassava fields in thirteen cassava-producing districts, spanning seven agro-ecological zones with contrasting climatic conditions and different cassava cultivars. HTS allowed the assembly of a nearly complete consensus genome of UCBSV in twelve districts. The phylogenetic analysis revealed high homology between UCBSV genome sequences, with a maximum of 0.8 per cent divergence between genomes at the nucleotide level. An in-depth investigation based on Single Nucleotide Polymorphisms (SNPs) was conducted to explore the genome diversity beyond the consensus sequences. First, to ensure the validity of the result, a panel of SNPs was confirmed by independent reverse transcription polymerase chain reaction (RT-PCR) and Sanger sequencing. Furthermore, the combination of fixation index (FST) calculation and Principal Component Analysis (PCA) based on SNP patterns identified three different UCBSV haplotypes geographically clustered. The haplotype 2 (H2) was restricted to the central regions, where the NAROCAS 1 cultivar is predominantly farmed. RT-PCR and Sanger sequencing of individual NAROCAS1 plants confirmed their association with H2. Haplotype 1 was widely spread, with a 100 per cent occurrence in the Eastern region, while Haplotype 3 was only found in the Western region. These haplotypes' associations with specific cultivars or regions would need further confirmation. Our results prove that a much more complex picture of genetic diversity can be deciphered beyond the consensus sequences, with practical implications on virus epidemiology, evolution, and disease management. Our methodology proposes a high-resolution analysis of genome diversity beyond the consensus between and within samples. It can be used at various scales, from individual plants to pooled samples of virus-infected plants. Our findings also showed how subtle genetic differences could be informative on the potential impact of agricultural practices, as the presence and frequency of a virus haplotype could be correlated with the dissemination and adoption of improved cultivars.

Read full abstract

Technologies identifying single nucleotide polymorphisms (SNPs) in DNA sequencing yield an avalanche of data requiring analysis and interpretation. Standard methods may require many weeks of processing time. The use of statistical methods requiring data sorting, matrix inversions of a high-dimension and replication in subsets of the data on multiple outcomes exacerbate these times.A method which reduces the computational time in problems with time-to-event outcomes and hundreds of thousands/millions of SNPs using Cox-Snell residuals after fitting the Cox proportional hazards model (PH) to a fixed set of concomitant variables is proposed. This yields coefficients for SNP effect from a Cox-Snell adjusted Poisson model and shows a high concordance to the adjusted PH model.The method is illustrated with a sample of 10000 SNPs from a genome-wide association study in a diabetic population. The gain in processing efficiency using the proposed method based on Poisson modelling can be as high as 62%. This could result in saving of over three weeks processing time if 5 million SNPs require analysis. The method involves only a single predictor variable (SNP), offering a simpler, computationally more stable approach to examining and identifying SNP patterns associated with the outcome(s) allowing for a faster development of genetic signatures. Use of deviance residuals from the PH model to screen SNPs demonstrates a large discordance rate at a 0.2% threshold of concordance. This rate is 15 times larger than that based on the Cox-Snell residuals from the Cox-Snell adjusted Poisson model. The method is simple to implement as the procedures are available in most statistical packges. The approach involves obtaining Cox-Snell residuals from a PH model, to a binary time-to-event outcome, for factors which need to be common when assessing each SNP. Each SNP is then fitted as a predictor to the outcome of interest using a Poisson model with the Cox-Snell as the exposure variable.

Read full abstract

Single Nucleotide Polymorphism Patterns Research Articles

Related Topics

Articles published on Single Nucleotide Polymorphism Patterns

Single Nucleotide Polymorphism-based Identification of Bacterial Artificial Chromosome-mediated Homologous Recombination.

TNFα rs1800629 Polymorphism and Response to Anti-TNFα Treatment in Behçet Syndrome: Data from an Italian Cohort Study.

Going beyond consensus genome sequences: An innovative SNP-based methodology reconstructs different Ugandan cassava brown streak virus haplotypes at a nationwide scale inRwanda.

Evaluation of SNP-Based Markers Utilization for Resistance to Fall Armyworm Spodoptera frugiperda on Eight Corn Varieties

Performance of the tetra-primer PCR technique compared to PCR-RFLP in the search for rs12979860 (C/T) and rs8099917 (T/G) single nucleotide polymorphisms (SNPs) in the IFNL4 gene

Performance of the tetra-primer PCR technique compared to PCR-RFLP in the search for rs12979860 (C/T) and rs8099917 (T/G) single nucleotide polymorphisms (SNPs) in the IFNL4 gene

Improving efficiency of fitting Cox proportional hazards models for time-to-event outcomes in genome-wide association studies (GWAS).

Allelic Variants of HLA-C Upstream Region, PSORS1C3, MICA, TNFA and Genes Involved in Epidermal Homeostasis and Barrier Function Influence the Clinical Response to Anti-IL-12/IL-23 Treatment of Patients with Psoriasis.

Functional analysis of intergenic regulatory regions of genes encoding surface adhesins in Staphylococcus aureus isolates from periprosthetic joint infections

Unveiling biogeographical patterns in the worldwide distributed Ceratitis capitata (medfly) using population genomics and microbiome composition.

Decomposition of Individual SNP Patterns from Mixed DNA Samples

Characterizing Fractal Genetic Variation in the Human Genome from the Hapmap Project.

An assembly-free method of phylogeny reconstruction using short-read sequences from pooled samples without barcodes

The development of unlabeled probes-high resolution melting (UP-HRM) marker on SAD, IAA27 and ACC genes of oil palm

SARS-CoV-2 genomic diversity and the implications for qRT-PCR diagnostics and transmission.

HLA-Cw6 and other HLA-C alleles, as well as MICB-DT, DDX58, and TYK2 genetic variants associate with optimal response to anti-IL-17A treatment in patients with psoriasis

Mitochondrial DNA Repair in an Arabidopsis thaliana Uracil N-Glycosylase Mutant.

Mitochondrial D-loop informative SNPs in identification of dog’s breed

SNP and indel frequencies at transcription start sites and at canonical and alternative translation initiation sites in the human genome.

Deletion of biosynthetic genes, specific SNP patterns and differences in transcript accumulation cause variation in hydroxynitrile glucoside content in barley cultivars

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Single Nucleotide Polymorphism Patterns Research Articles

Related Topics

Articles published on Single Nucleotide Polymorphism Patterns

Single Nucleotide Polymorphism-based Identification of Bacterial Artificial Chromosome-mediated Homologous Recombination.

TNFα rs1800629 Polymorphism and Response to Anti-TNFα Treatment in Behçet Syndrome: Data from an Italian Cohort Study.

Going beyond consensus genome sequences: An innovative SNP-based methodology reconstructs different Ugandan cassava brown streak virus haplotypes at a nationwide scale inRwanda.

Evaluation of SNP-Based Markers Utilization for Resistance to Fall Armyworm Spodoptera frugiperda on Eight Corn Varieties

Performance of the tetra-primer PCR technique compared to PCR-RFLP in the search for rs12979860 (C/T) and rs8099917 (T/G) single nucleotide polymorphisms (SNPs) in the IFNL4 gene

Performance of the tetra-primer PCR technique compared to PCR-RFLP in the search for rs12979860 (C/T) and rs8099917 (T/G) single nucleotide polymorphisms (SNPs) in the IFNL4 gene

Improving efficiency of fitting Cox proportional hazards models for time-to-event outcomes in genome-wide association studies (GWAS).

Allelic Variants of HLA-C Upstream Region, PSORS1C3, MICA, TNFA and Genes Involved in Epidermal Homeostasis and Barrier Function Influence the Clinical Response to Anti-IL-12/IL-23 Treatment of Patients with Psoriasis.

Functional analysis of intergenic regulatory regions of genes encoding surface adhesins in Staphylococcus aureus isolates from periprosthetic joint infections

Unveiling biogeographical patterns in the worldwide distributed Ceratitis capitata (medfly) using population genomics and microbiome composition.

Decomposition of Individual SNP Patterns from Mixed DNA Samples

Characterizing Fractal Genetic Variation in the Human Genome from the Hapmap Project.

An assembly-free method of phylogeny reconstruction using short-read sequences from pooled samples without barcodes

The development of unlabeled probes-high resolution melting (UP-HRM) marker on SAD, IAA27 and ACC genes of oil palm

SARS-CoV-2 genomic diversity and the implications for qRT-PCR diagnostics and transmission.

HLA-Cw6 and other HLA-C alleles, as well as MICB-DT, DDX58, and TYK2 genetic variants associate with optimal response to anti-IL-17A treatment in patients with psoriasis

Mitochondrial DNA Repair in an Arabidopsis thaliana Uracil N-Glycosylase Mutant.

Mitochondrial D-loop informative SNPs in identification of dog’s breed

SNP and indel frequencies at transcription start sites and at canonical and alternative translation initiation sites in the human genome.

Deletion of biosynthetic genes, specific SNP patterns and differences in transcript accumulation cause variation in hydroxynitrile glucoside content in barley cultivars