Abstract

The genetic diversity of plants has traditionally been employed to improve crop plants to suit human needs, and in the future feed the increasing population and protect crops from environmental stresses and climate change. Genome-wide sequencing is a reality and can be used to make association to crop traits to be utilized by high-throughput marker based selection methods. This study describes a strategy of using next generation sequencing (NGS) data from the rice genome to make comparisons to the high-quality reference genome, identify functional polymorphisms within genes that might result in function changes and be used to study correlations to traits and employed in genetic mapping. We analyzed the NGS data of Oryza sativa ssp indica cv. G4 covering 241 Mb with ∼20X coverage and compared to the reference genome of Oryza sativa ssp. japonica to describe the genome-wide distribution of gene-based single nucleotide polymorphisms (SNPs). The analysis shows that the 63% covered genome consists of 1.6 million SNPs with 6.9 SNPs/Kb, and including 80,146 insertions and 92,655 deletions (INDELs) genome-wide. There are a total of 1,139,801 intergenic SNPs, 295,136 SNPs in intronic/non-coding regions, 195,098 in coding regions, 23,242 SNPs at the five-prime (5′) UTR regions and 22,686 SNPs at the three-prime (3′) UTR region. SNP variation was found in 40,761 gene loci, which include 75,262 synonymous and 119,836 non-synonymous changes, and functional reading frame changes through 3,886 inducing STOP-codon (isSNP) and 729 preventing STOP-codon (psSNP) variation. There are quickly evolving 194 high SNP hotspot genes (>100 SNPs/gene), and 1,513 out of 2,458 transcription factors displaying 2,294 non-synonymous SNPs that can be a major source of phenotypic diversity within the species. All data is searchable at https://plantstress-pereira.uark.edu/oryza2. We envision that this strategy will be useful for the identification of genes for crop traits and molecular breeding of rice cultivars.

Highlights

  • The world’s geometrically growing population has passed 7 billion, and it is estimated that rice production must increase 24%by 2025 to meet the global population demand [1]

  • The analysis shows that the 63% covered genome consists of 1.6 million single nucleotide polymorphisms (SNPs) with 6.9 SNPs/Kb, and including 80,146 insertions and 92,655 deletions (INDELs) genome-wide

  • Natural phenotypic differences are amenable to genetic dissection, right up to the identification of causal DNA polymorphisms and functional nucleotide polymorphisms accounting for plant intraspecific developmental diversity

Read more

Summary

Introduction

The world’s geometrically growing population has passed 7 billion, and it is estimated that rice production must increase 24%by 2025 to meet the global population demand [1]. The rice genome, considered as a global heritage, has the information and resources to continue to feed half the population of the world. The diversity of rice across the world is an excellent resource to understand the genetic events of human selection for traits of utility, and to discover new alleles for traits using plant genomics as it represents a modest size of 389 Mb for a cereal genome [2]. Natural phenotypic differences are amenable to genetic dissection, right up to the identification of causal DNA polymorphisms and functional nucleotide polymorphisms accounting for plant intraspecific developmental diversity. These variations often affect gene structure and may contribute to interspecific phenotypic traits and adaptation [10].

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call