Core Hunter: an algorithm for sampling genetic resources based on multiple genetic measures

Chris Thachuk,Jorge Franco,Guy F Davenport,Marilyn Warburton,Susanne Dreisigacker,José Crossa

doi:10.1186/1471-2105-10-243

Abstract

BackgroundExisting algorithms and methods for forming diverse core subsets currently address either allele representativeness (breeder's preference) or allele richness (taxonomist's preference). The main objective of this paper is to propose a powerful yet flexible algorithm capable of selecting core subsets that have high average genetic distance between accessions, or rich genetic diversity overall, or a combination of both.ResultsWe present Core Hunter, an advanced stochastic local search algorithm for selecting core subsets. Core Hunter is able to find core subsets having more genetic diversity and better average genetic distance than the current state-of-the-art algorithms for all genetic distance and diversity measures we evaluated. Furthermore, Core Hunter can attempt to optimize any number of genetic measures simultaneously, based on the preference of the user. Notably, Core Hunter is able to select significantly smaller core subsets, which retain all unique alleles from a reference collection, than state-of-the-art algorithms.ConclusionCore Hunter is a highly effective and flexible tool for sampling genetic resources and establishing core subsets. Our implementation, documentation, and source code for Core Hunter is available at

Highlights

IntroductionGenetic resources stored in gene banks are usually sampled with the purpose of evaluating and utilizing them efficiently, as well as studying phenotypic and genotypic diversity, identifying duplicate accessions, and forming core subsets
Existing algorithms and methods for forming diverse core subsets currently address either allele representativeness or allele richness
We have demonstrated that our proposed algorithm for core subset selection, Core Hunter, has improved upon state-of-the-art selection methodologies in several ways

Summary

Introduction

Genetic resources stored in gene banks are usually sampled with the purpose of evaluating and utilizing them efficiently, as well as studying phenotypic and genotypic diversity, identifying duplicate accessions, and forming core subsets. The aim of the latter activity is to preserve in the sample as much of the diversity present in the original collection as possible. Core subset selection can be based on varying criteria including phenotypic traits or various forms of molecular marker data including, but not limited to, single nucleotide polymorphisms (SNP), amplified fragment length polymorphisms (AFLP), random amplified polymorphic DNA (RAPD), and simple sequence repeats (SSR).

Objectives

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Bioinformatics	Publication Date: Aug 6, 2009
Citations: 160	License type: CC BY 2.0

R Discovery Prime

R Discovery Prime

Core Hunter: an algorithm for sampling genetic resources based on multiple genetic measures

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics

Lead the way for us

Similar Papers

On Genetic Information, Diversity and Distance
I Vajda ... J Zvárová
Methods of Information in Medicine | VOL. 45
I Vajda, et. al.I Vajda ... J Zvárová
01 Jan 2006
Methods of Information in Medicine | VOL. 45

Genetic diversity and population structure among six cattle breeds in South Africa using a whole genome SNP panel.
Sithembile O Makina ... Azwihangwisi Maiwashe
Frontiers in genetics | VOL. 5
Sithembile O Makina, et. al.Sithembile O Makina ... Azwihangwisi Maiwashe
22 Sep 2014
Frontiers in genetics | VOL. 5

Comparison of optimization methods for core subset selection from a large collection of Mexican wheat landraces characterized by SNP markers
Carlos L Acuña-Matamoros ... M Humberto Reyes-Valdés
Plant Genetic Resources: Characterization and Utilization | VOL. 16
Carlos L Acuña-Matamoros, et. al.Carlos L Acuña-Matamoros ... M Humberto Reyes-Valdés
25 Sep 2017
Plant Genetic Resources: Characterization and Utilization | VOL. 16

Genetic Diversity and Population Structure of Local Chicken Ecotypes in Burkina Faso Using Microsatellite Markers.
Zare Yacouba ... Nianogo A Joseph
Genes | VOL. 13
Zare Yacouba, et. al.Zare Yacouba ... Nianogo A Joseph
24 Aug 2022
Genes | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Core Hunter: an algorithm for sampling genetic resources based on multiple genetic measures

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics