Single Nucleotide Polymorphism Datasets Research Articles

The objective of this study was to assess the effect of using or not the genotypes of the parents of a cow for imputing single nucleotide polymorphisms (SNP), on the estimation of genomic inbreeding coefficients of cows. Imputation (i.e., genotyped plus imputed) genotypes from 68,127 Italian Holstein dairy cows registered in the Italian National Association of Holstein, Brown and Jersey Breeders (ANAFIBJ) were analyzed. Cows were genotyped with the HD Illumina Infinium BovineHD BeadChip and GeneSeek Genomic Profiler HD-150K, and the MD GeneSeek Genomic Profiler 3, GeneSeek Genomic Profiler 4, GeneSeek MD and the Labogena MD. To assess differences among estimators genomic inbreeding coefficients were estimated with 4 PLINK v1.9 estimators (F, Fhat1, 2, 3), 2 genomic relationship matrix (grm) based estimators (Fgrm and Fgrm2; with the latter including also pedigree information) and one estimator of runs of homozygosity (ROH; FROH). Assuming that the correct genomic inbreeding coefficients should be those estimated from genotyped SNP, a comparison of the genomic inbreeding coefficients estimated either with the genotyped SNP or the SNP after imputation was made. Information on the presence or absence of genotypic information from sire, dam and maternal grandsire during the imputation was investigated. Genomic inbreeding coefficients estimated with genotyped SNP or SNP after imputation were consistent for F, Fhat3, Fgrm2 and FROH, when at least one of the parents was genotyped. Biased (mainly higher) genomic inbreeding coefficients of imputation SNP were observed in cows that were genotyped with MD SNP panels whose SNP were poorly represented in the selected imputation SNP data set and also did not have their parents genotyped compared with what expected based on actual genotype data. For cows genotyped with MD the estimators Fhat1, Fhat2 and Fgrm provided higher genomic inbreeding coefficients of imputation SNP even with both parents and the maternal grandsire genotyped. Overall, FROH was the most robust estimator, followed by F and Fhat3. Our findings suggest that SNP selection, parental genotyping and estimator should be considered for designing imputation strategies in dairy cattle for estimating genomic inbreeding with imputation SNP. For computing genomic inbreeding coefficients, it is recommendable to have at least one parent genotyped and use an ROH based estimator.

Read full abstract

Biogeographical relationships between the Canary Islands and north-west Africa are often explained by oceanic dispersal and geographical proximity. Sister-group relationships between Canarian and eastern African/Arabian taxa, the 'Rand Flora' pattern, are rare among plants and have been attributed to the extinction of north-western African populations. Euphorbia balsamifera is the only representative species of this pattern that is distributed in the Canary Islands and north-west Africa; it is also one of few species present in all seven islands. Previous studies placed African populations of E. balsamifera as sister to the Canarian populations, but this relationship was based on herbarium samples with highly degraded DNA. Here, we test the extinction hypothesis by sampling new continental populations; we also expand the Canarian sampling to examine the dynamics of island colonization and diversification. Using target enrichment with genome skimming, we reconstructed phylogenetic relationships within E. balsamifera and between this species and its disjunct relatives. A single nucleotide polymorphism dataset obtained from the target sequences was used to infer population genetic diversity patterns. We used convolutional neural networks to discriminate among alternative Canary Islands colonization scenarios. The results confirmed the Rand Flora sister-group relationship between western E. balsamifera and Euphorbia adenensis in the Eritreo-Arabian region and recovered an eastern-western geographical structure among E. balsamifera Canarian populations. Convolutional neural networks supported a scenario of east-to-west island colonization, followed by population extinctions in Lanzarote and Fuerteventura and recolonization from Tenerife and Gran Canaria; a signal of admixture between the eastern island and north-west African populations was recovered. Our findings support the Surfing Syngameon Hypothesis for the colonization of the Canary Islands by E. balsamifera, but also a recent back-colonization to the continent. Populations of E. balsamifera from northwest Africa are not the remnants of an ancestral continental stock, but originated from migration events from Lanzarote and Fuerteventura. This is further evidence that oceanic archipelagos are not a sink for biodiversity, but may be a source of new genetic variability.

Read full abstract

Single Nucleotide Polymorphism Datasets Research Articles

Related Topics

Articles published on Single Nucleotide Polymorphism Datasets

Genotyping Error Detection and Customised Filtration for SNP Datasets.

CannSeek? Yes we Can! Anopen-source single nucleotide polymorphism database and analysis portal for Cannabis sativa.

A newly developed 20 K SNP array reveals QTLs for disease resistance to Cryptocaryon irritans in tiger pufferfish (Takifugu rubripes)

Genetic diversity of the Turkish accessions of two progenitor species, Triticum baeoticum Boiss. and Triticum urartu Thum. ex Gandil., using DArTSeq markers

SnpAIMeR: R package for evaluating ancestry informative marker contributions in non-model population diagnostics.

Assessing the reproducibility of machine-learning-based biomarker discovery in Parkinson’s disease

Genomic inbreeding coefficients using imputation genotypes: Assessing the effect of ancestral genotyping in Holstein-Friesian dairy cows

Influence of molecular marker type on estimating effective population size and other genetic parameters in a critically endangered parrot.

The sweet tabaiba or there and back again: phylogeographical history of the Macaronesian Euphorbia balsamifera.

Ecological opportunity leads to higher diversity and probability of trophic specialization in Arctic charr

Drought-induced growth phenotypes are associated with genetic variation across a white pine hybrid zone

A new wild emmer wheat panel allows to map new loci associated with resistance to stem rust at seedling stage.

Dataset of single nucleotide polymorphisms of immune-associated genes in patients with SARS-CoV-2 infection.

Combined reference-free and multi-reference based GWAS uncover cryptic variation underlying rapid adaptation in a fungal plant pathogen.

Genome-wide association studies using multi-models and multi-SNP datasets provide new insights into pasmo resistance in flax.

The genome-wide meiotic recombination landscape in ciliates and its implications for crossover regulation and genome evolution

Lifting of the 1,000 wheat exome project SNPs from Triticum aestivum cv. Chinese Spring assembly RefSeq v1.0 to RefSeq v2.1

Genetic diversity and signature of divergence in the genome of grapevine clones of Southern Italy varieties.

A review of machine learning models applied to genomic prediction in animal breeding.

Speciation patterns of related species under the hybrid zone: A case study of three sclerophyllous oaks in the east Himalaya-Hengduan Mountains.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Single Nucleotide Polymorphism Datasets Research Articles

Related Topics

Articles published on Single Nucleotide Polymorphism Datasets

Genotyping Error Detection and Customised Filtration for SNP Datasets.

CannSeek? Yes we Can! Anopen-source single nucleotide polymorphism database and analysis portal for Cannabis sativa.

A newly developed 20 K SNP array reveals QTLs for disease resistance to Cryptocaryon irritans in tiger pufferfish (Takifugu rubripes)

Genetic diversity of the Turkish accessions of two progenitor species, Triticum baeoticum Boiss. and Triticum urartu Thum. ex Gandil., using DArTSeq markers

SnpAIMeR: R package for evaluating ancestry informative marker contributions in non-model population diagnostics.

Assessing the reproducibility of machine-learning-based biomarker discovery in Parkinson’s disease

Genomic inbreeding coefficients using imputation genotypes: Assessing the effect of ancestral genotyping in Holstein-Friesian dairy cows

Influence of molecular marker type on estimating effective population size and other genetic parameters in a critically endangered parrot.

The sweet tabaiba or there and back again: phylogeographical history of the Macaronesian Euphorbia balsamifera.

Ecological opportunity leads to higher diversity and probability of trophic specialization in Arctic charr

Drought-induced growth phenotypes are associated with genetic variation across a white pine hybrid zone

A new wild emmer wheat panel allows to map new loci associated with resistance to stem rust at seedling stage.

Dataset of single nucleotide polymorphisms of immune-associated genes in patients with SARS-CoV-2 infection.

Combined reference-free and multi-reference based GWAS uncover cryptic variation underlying rapid adaptation in a fungal plant pathogen.

Genome-wide association studies using multi-models and multi-SNP datasets provide new insights into pasmo resistance in flax.

The genome-wide meiotic recombination landscape in ciliates and its implications for crossover regulation and genome evolution

Lifting of the 1,000 wheat exome project SNPs from Triticum aestivum cv. Chinese Spring assembly RefSeq v1.0 to RefSeq v2.1

Genetic diversity and signature of divergence in the genome of grapevine clones of Southern Italy varieties.

A review of machine learning models applied to genomic prediction in animal breeding.

Speciation patterns of related species under the hybrid zone: A case study of three sclerophyllous oaks in the east Himalaya-Hengduan Mountains.