The timing of flowering in soybean [Glycine max(L.) Merr.], a key legume crop, is influenced by many factors, including daylight length or photoperiodic sensitivity, that affect crop yield, productivity, and geographical adaptation. Despite its importance, a comprehensive understanding of the local linkage landscape and allelic diversity within regions of the genome influencing flowering and contributing to phenotypic variation in subpopulations has been limited. This study addresses these gaps by conducting an in-depth trait association and linkage analysis coupled with local haplotyping using advanced bioinformatics tools, including crosshap, to characterize genomic variation using a pangenome dataset representing 915 domesticated and wild-type individuals. The association analysis identified eight significant loci on seven chromosomes. Moving beyond traditional association analysis, local haplotyping of targeted regions on chromosomes 6 and 20 identified distinct haplotype structures, variation patterns, and genomic candidates influencing flowering in subpopulations. These results suggest the action of a network of genomic candidates influencing flowering time and an untapped reservoir of genomic variation for this trait in wild germplasm. Notably, GlymaLee.20G147200 on chromosome 20 was identified as a candidate gene that may cause delayed flowering in soybean, potentially through histone modifications of floral repressor loci as seen in Arabidopsis thaliana (L.) Heynh. These findings support future functional validation of haplotype-based alleles for marker-assisted breeding and genomic selection to enhance latitude adaptability of soybean without compromising yield.
Read full abstract