Genomic variation in 3,010 diverse accessions of Asian cultivated rice

Wensheng Wang,Zichao Li,Hongliang Zhang,Zhen Yue,Min Li,Chunchao Wang,Zhaotong Dong,Xueqiang Wang,J Mendoza ,Jiabao Xu,Yue Zhao,Miao Wang,Locedie Mansueto,Fan Zhang,Jiannong Shi ,Binying Fu,Ben Jia,Millicent D Sanciangco ,Jinyuan Lu,Xiaodong Fang,Ma Elizabeth B Naredo,Frances Nikki Borja,Alexander Poliakov,Chen Sun,Xiao Chen ,Jeffrey Detras,Rui Li,Dario Copetti,Dave Kudrna,Zhiqiang Hu,Jing Li,Qiang Gao,Rod A Wing,Jayson Talag,Inna Dubchak,Yongchao Niu,Nickolai Alexandrov,Tianqing Zheng,Mengjie Zhang ,Yongming Gao,Kevin Palis ,Dmytro Chebotarov,Jianlong Xu,Wushu Hu,Hong Yu,Jean Christophe Glaszmann ,Seung‐Hee Lee ,Yanhong Li,Ruaraidh Sackville Hamilton,Yongli Zhou,Xiuqin Zhao,Zhikang Li,Xiancong He ,Ramil Mauleon,Kenneth L Mcnally,Miaolin Chen,Chaochun Wei,Ye Yin,Zhichao Wu,Fei Shen,Dabing Zhang,Gengyun Zhang,Victor Jun Ulat,Jue Ruan,Jiayang Li,Jianwei Zhang,Jauhar Ali,Shuaishuai Tai,Roven Rommel Fuentes,Hei Leung

doi:10.1038/s41586-018-0063-9

Abstract

Here we analyse genetic variation, population structure and diversity among 3,010 diverse Asian cultivated rice (Oryza sativa L.) genomes from the 3,000 Rice Genomes Project. Our results are consistent with the five major groups previously recognized, but also suggest several unreported subpopulations that correlate with geographic location. We identified 29 million single nucleotide polymorphisms, 2.4 million small indels and over 90,000 structural variations that contribute to within- and between-population variation. Using pan-genome analyses, we identified more than 10,000 novel full-length protein-coding genes and a high number of presence–absence variations. The complex patterns of introgression observed in domestication genes are consistent with multiple independent rice domestication events. The public availability of data from the 3,000 Rice Genomes Project provides a resource for rice genomics research and breeding.

Highlights

We analyse genetic variation, population structure and diversity among 3,010 diverse Asian cultivated rice (Oryza sativa L.) genomes from the 3,000 Rice Genomes Project
Using pan-genome analyses, we identified more than 10,000 novel full-length protein-coding genes and a high number of presence–absence variations
We report pan-genome analyses for O. sativa, and the high numbers of presence–absence variations (PAVs) highlight another component of within-species diversity for rice

Summary

GJ cA cB Admix

Presence frequency 0 0.25 0.50 0.75 1 major-group-unbalanced SVs unevenly distributed among XI, GJ, cA and cB on the basis of two-sided Fisher’s exact tests. In all major groups formed candidate core gene families, and the remaining 9,050 (37.9%) comprised distributed gene families (Fig. 4a, b and Supplementary Data 3 Table 3). The O. sativa pan-genome consists of between 12,770 and approximately 14,826 (53.5% to about 62.1%) core gene families, and at least 9,050 (37.9%) distributed gene families: each accession contains between 63.4% and about 73.5% core gene families and at least 26.5% distributed gene families (Fig. 4b). We found 98.4% of the IR 8 and 98.6% of the N 22 genome sequences could be mapped to the pangenome, whereas only 94.3% and 94.0% could be found in Nipponbare RefSeq. By comparing pan-genome data with high-quality XI reference genomes of Zhenshan 97 and Minghui 6330, approximately 25% of the novel genes were shorter owing to gene predictions from fragmented sequences (Extended Data Fig. 5c, d). We identified 4,270 XI and 1,384 GJ subpopulation-unbalanced gene families, showing variation between subpopulations within each major group (Extended Data Fig. 7g). Correlation between gene PAVs and plant height detected the well-known green revolution gene (sd1) as the first-ranked candidate. sd[1] is classified as a distributed gene—caused by an approximately 385-bp deletion— and is significantly (P value < 10−20) associated with greatly reduced plant height; it was absent most frequently in XI-1A and XI-1B varieties (Extended Data Fig. 11)

Discussion

Methods

Sample size

Randomization

Statistical parameters

Findings

Antibodies

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Nature	Publication Date: Apr 25, 2018
Citations: 1083	License type: open-access

R Discovery Prime

R Discovery Prime

Genomic variation in 3,010 diverse accessions of Asian cultivated rice

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Nature

Lead the way for us

Similar Papers

Self-similar characteristics of single nucleotide polymorphisms in the rice genome
Chang-Yong Lee
Journal of the Korean Physical Society | VOL. 69
Chang-Yong LeeChang-Yong Lee
01 Nov 2016
Journal of the Korean Physical Society | VOL. 69

Rice bioinformatics. analysis of rice sequence data and leveraging the data to other plant species.
Qiaoping Yuan ... Steven L Salzberg
Plant Physiology | VOL. 125
Qiaoping Yuan, et. al.Qiaoping Yuan ... Steven L Salzberg
01 Mar 2001
Plant Physiology | VOL. 125

BGI-RIS: an integrated information resource and comparative analysis workbench for rice genomics
W Zhao
Nucleic Acids Research | VOL. 32
W ZhaoW Zhao
01 Jan 2004
Nucleic Acids Research | VOL. 32

QTL mapping and candidate gene analysis of low temperature germination in rice (Oryza sativa L.) using a genome wide association study.
Feng Mao ... Depeng Wu
PeerJ | VOL. 10
Feng Mao, et. al.Feng Mao ... Depeng Wu
11 May 2022
PeerJ | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Genomic variation in 3,010 diverse accessions of Asian cultivated rice

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Nature