Abstract

Genomic structural variants (SVs) are a major source of genetic and phenotypic variation but have not been investigated systematically in rainbow trout (Oncorhynchus mykiss), an important aquaculture species of cold freshwater. The objectives of this study were 1) to identify and validate high-confidence SVs in rainbow trout using whole-genome re-sequencing; and 2) to examine the contribution of transposable elements (TEs) to SVs in rainbow trout. A total of 96 rainbow trout, including 11 homozygous lines and 85 outbred fish from three breeding populations, were whole-genome sequenced with an average genome coverage of 17.2×. Putative SVs were identified using the program Smoove which integrates LUMPY and other associated tools into one package. After rigorous filtering, 13,863 high-confidence SVs were identified. Pacific Biosciences long-reads of Arlee, one of the homozygous lines used for SV detection, validated 98% (3,948 of 4,030) of the high-confidence SVs identified in the Arlee homozygous line. Based on principal component analysis, the 85 outbred fish clustered into three groups consistent with their populations of origin, further indicating that the high-confidence SVs identified in this study are robust. The repetitive DNA content of the high-confidence SV sequences was 86.5%, which is much higher than the 57.1% repetitive DNA content of the reference genome, and is also higher than the repetitive DNA content of Atlantic salmon SVs reported previously. TEs thus contribute substantially to SVs in rainbow trout as TEs make up the majority of repetitive sequences. Hundreds of the high-confidence SVs were annotated as exon-loss or gene-fusion variants, and may have phenotypic effects. The high-confidence SVs reported in this study provide a foundation for further rainbow trout SV studies.

Highlights

  • Structural variants (SVs) refer to sequence variants greater than 50 bp in size (Alkan et al, 2011; Ho et al, 2020)

  • This study reported that 46 SVs were significantly associated with the annual variance of sea surface temperature while single nucleotide polymorphism (SNP) failed to reveal population adaption to local environments

  • Reads mapped to the 29 chromosomes were used for SV detection as described below, and scaffolds that cannot be assigned to the 29 chromosomes and mitochondria sequence were excluded from SV detection

Read more

Summary

Introduction

Structural variants (SVs) refer to sequence variants greater than 50 bp in size (Alkan et al, 2011; Ho et al, 2020). SVs are a major source of genetic variation, and affect a larger proportion of the human genome than single nucleotide polymorphism (SNP) and any other genetic variants (Sudmant et al, 2015). It is essential to study SVs in order to explore the full spectrum of genetic variation. Based on whole-genome sequences of 492 Atlantic salmon (Salmo salar L.), 15,483 high-confidence SVs were identified (Bertolotti et al, 2020). This study reported that 46 SVs were significantly associated with the annual variance of sea surface temperature while SNPs failed to reveal population adaption to local environments

Objectives
Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call