The Han populations represent the largest ethnic group in China. Previous studies have primarily focused on investigating their genetic origins, migration and integration, as well as paternal genetic relationships within specific regional Han populations. However, a comprehensive analysis of the global paternal genetic structure of Han populations is lacking. In this study, we performed Y-chromosome sequencing on 362 unrelated male samples from Chinese Han individuals collected from Qinghai, Sichuan and Liaoning provinces. We then integrated relevant data from reported studies. Our final dataset comprised 1830 samples from 16 Han populations across 15 provinces in China, encompassing information on 89 Y-SNPs and 16 Y-STRs. Statistical analyses were conducted to assess Y-STR haplotype diversity (HD) and Y-SNP haplogroup frequencies. Additionally, we employed principal component analysis (PCA), phylogenetic tree and haplotype network to explore genetic differentiation within Han populations and the genetic relationships between Han populations and ethnic minorities surrounding them. Our results demonstrated that the O-M175 haplogroup represents the predominant paternal lineage in Han populations, with frequencies ranging from 60.53% (Qinghai Han) to 92.7% (Guangdong Han). Moreover, the subclades downstream of O-M175 showed distinct regional variations in their distribution patterns. The O2-M122 haplogroup was prevalent in all Han populations and demonstrated a gradual decline in frequency from north to south. Conversely, the distribution frequency of the O1b-M268 haplogroup decreased from south to north, particularly showed significant presence among Han populations in the Lingnan region. Haplogroup O1a-M119 distributed more frequently in the central Han populations. Our findings revealed that Chinese Han populations can be categorized into three subgroups: northern, central, and southern. Notably, there were significant differences among Han in Qinghai and other regions. Regarding the genetic relationships between Han populations and surrounding ethnic minorities, we observed a closer genetic affinity between different Han populations, but northern Han demonstrated a stronger relationship with the Hui ethnic group, while southern Han exhibited a closer connection with the Gelao and Li ethnic groups. In summary, this study presented a systematic analysis of haplogroup distribution, genetic substructure of Han populations and genetic relationships between Han populations and surrounding ethnic minorities based on 89 Y-SNPs and 16 Y-STRs systematically. Our research supplemented valuable insights into population genetics and forensic genetics, and provided data support for the forensic application of Y chromosome. The integration of Y-SNP haplogroups with Y-STR haplotypes offers enhanced understanding of the genetic substructure within Han populations, which holds significance for both population genetics research and forensic science applications.
Read full abstract