Southern Chinese Han Population Research Articles

Inferring the number of contributors (NoC) is a crucial step in interpreting DNA mixtures, as it directly affects the accuracy of the likelihood ratio calculation and the assessment of evidence strength. However, obtaining the correct NoC in complex DNA mixtures remains challenging due to the high degree of allele sharing and dropout. This study aimed to analyze the impact of allele sharing and dropout on NoC inference in complex DNA mixtures when using microhaplotypes (MH). The effectiveness and value of highly polymorphic MH for NoC inference in complex DNA mixtures were evaluated through comparing the performance of three NoC inference methods, including maximum allele count (MAC) method, maximum likelihood estimation (MLE) method, and random forest classification (RFC) algorithm. In this study, we selected the top 100 most polymorphic MH from the Southern Han Chinese (CHS) population, and simulated over 40 million complex DNA mixture profiles with the NoC ranging from 2 to 8. These profiles involve unrelated individuals (RM type) and related pairs of individuals, including parent-offspring pairs (PO type), full-sibling pairs (FS type), and second-degree kinship pairs (SE type). Our results indicated that how the number of detected alleles in DNA mixture profiles varied with the markers’ polymorphism, kinship’s involvement, NoC, and dropout settings. Across different types of DNA mixtures, the MAC and MLE methods performed best in the RM type, followed by SE, FS, and PO types, while RFC models showed the best performance in the PO type, followed by RM, SE, and FS types. The recall of all three methods for NoC inference were decreased as the NoC and dropout levels increased. Furthermore, the MLE method performed better at low NoC, whereas RFC models excelled at high NoC and/or high dropout levels, regardless of the availability of a priori information about related pairs of individuals in DNA mixtures. However, the RFC models which considered the aforementioned priori information and were trained specifically on each type of DNA mixture profiles, outperformed RFC_ALL model that did not consider such information. Finally, we provided recommendations for model building when applying machine learning algorithms to NoC inference.

Read full abstract

Maoming is located in the southwest region of Guangdong Province and is the cradle of Gaoliang culture, which is the representative branch of Lingnan cultures. Historical records showed that the amalgamations between Gaoliang aborigines and distinct ethnic minorities had some influences on the shaping of Gaoliang culture, especially for the local Tai-kadai language-speaking Baiyue and Han Chinese from Central China. However, there is still no exact genetic evidence for the influences on the genetic pool of Maoming Han, and the genetic relationships between Maoming Han and other Chinese populations are still unclear. Hence, in order to get a better understanding of the paternal genetic structures and characterize the forensic features of 27 Y-chromosomal short tandem repeats (Y-STRs) in Han Chinese from Guangdong Maoming, we firstly applied the AmpFLSTR® Yfiler® Plus PCR Amplification Kit (Thermo Fisher Scientific, Waltham, MA, United States) to genotype the haplotypes in 431 Han males residing in Maoming. A total of 263 different alleles were determined across all 27 Y-STRs with the corresponding allelic frequencies from 0.0004 to 0.7401, and the range of genetic diversity (GD) was 0.4027 (DYS391) to 0.9596 (DYS385a/b). In the first batch of 27 Yfiler data in Maoming Han, 417 distinct haplotypes were discovered, and nine off-ladder alleles were identified at six Y-STRs; in addition, no copy number variant or null allele was detected. The overall haplotype diversity (HD) and discrimination capacity (DC) of 27 Yfiler were 0.9997 and 0.9675, respectively, which demonstrated that the 6-dye and 27-plex system has sufficient system effectiveness for forensic applications in Maoming Han. What is more, the phylogenetic analyses indicated that Maoming Han, which is a Southern Han Chinese population, has a close relationship with Meizhou Kejia, which uncovered that the role of the gene flows from surrounding Han populations in shaping the genetic pool of Maoming Han cannot be ignored. From the perspectives of genetics, linguistics, and geographies, the genetic structures of Han populations correspond to the patterns of the geographical-scale spatial distributions and the relationships of language families. Nevertheless, no exact genetic evidence supports the intimate relationships between Maoming Han and Tai-Kadai language-speaking populations and Han populations of Central Plains in the present study.

Read full abstract

Southern Chinese Han Population Research Articles

Related Topics

Articles published on Southern Chinese Han Population

Using simulated microhaplotype genotyping data to evaluate the value of machine learning algorithms for inferring DNA mixture contributor numbers

The polymorphisms of miR-146a SNPs are associated with asthma in Southern Chinese Han population

ITPKC polymorphism (rs7251246 T > C), coronary artery aneurysms, and thrombosis in patients with Kawasaki disease in a Southern Han Chinese population.

Insight into forensic efficiency and genetic structure of the Guizhou Dong group via a 64-plex panel

HSP70 and TNF Loci Polymorphism Associated with the Posner-Schlossman Syndrome in a Southern Chinese Population.

Ocular phenotype related SNP analysis in Southern Han Chinese population from Guangdong province

HAAO rs3816183 Polymorphisms [T] Increase Anterior/Middle Hypospadias Risk in Southern Han Chinese Population.

Polymorphisms of SLC11A1(NRAMP1) rs17235409 associated with and susceptibility to spinal tuberculosis in a southern Han Chinese population.

Genome-wide association study of serum tumor markers in Southern Chinese Han population

Association study of a genetic variant in the long intergenic noncoding RNA (linc01080) with schizophrenia in Han Chinese

C5orf66 rs4976270/rs639933 Are Associated with Colorectal Cancer Risk in Southern Chinese Han Population: A Case-Control Study

Effect of SYTL3-SLC22A3 Variants, Their Haplotypes, and G × E Interactions on Serum Lipid Levels and the Risk of Coronary Artery Disease and Ischaemic Stroke

Gene polymorphisms of LGALS2, LGALS3 and LGALS9 in patients with rheumatoid arthritis

Insights Into Forensic Features and Genetic Structures of Guangdong Maoming Han Based on 27 Y-STRs.

HLA Risk Alleles in Aromatic Antiepileptic Drug-Induced Maculopapular Exanthema.

Increased hypospadias risk by GREM1 rs3743104[G] in the southern Han Chinese population.

A nonsynonymous polymorphism (rs117179004, T392M) of hyaluronidase 1 (HYAL1) is associated with increased risk of idiopathic pulmonary fibrosis in Southern Han Chinese

The role of NOTCH3 variants in Alzheimer's disease and subcortical vascular dementia in the Chinese population

Mutations in the sodium channel genes SCN1A, SCN3A, and SCN9A in children with epilepsy with febrile seizures plus(EFS+)

Associations between SNP83 of phosphodiesterase 4D gene and carotid atherosclerosis in a southern Chinese Han population: a case-control study.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Southern Chinese Han Population Research Articles

Related Topics

Articles published on Southern Chinese Han Population

Using simulated microhaplotype genotyping data to evaluate the value of machine learning algorithms for inferring DNA mixture contributor numbers

The polymorphisms of miR-146a SNPs are associated with asthma in Southern Chinese Han population

ITPKC polymorphism (rs7251246 T > C), coronary artery aneurysms, and thrombosis in patients with Kawasaki disease in a Southern Han Chinese population.

Insight into forensic efficiency and genetic structure of the Guizhou Dong group via a 64-plex panel

HSP70 and TNF Loci Polymorphism Associated with the Posner-Schlossman Syndrome in a Southern Chinese Population.

Ocular phenotype related SNP analysis in Southern Han Chinese population from Guangdong province

HAAO rs3816183 Polymorphisms [T] Increase Anterior/Middle Hypospadias Risk in Southern Han Chinese Population.

Polymorphisms of SLC11A1(NRAMP1) rs17235409 associated with and susceptibility to spinal tuberculosis in a southern Han Chinese population.

Genome-wide association study of serum tumor markers in Southern Chinese Han population

Association study of a genetic variant in the long intergenic noncoding RNA (linc01080) with schizophrenia in Han Chinese

C5orf66 rs4976270/rs639933 Are Associated with Colorectal Cancer Risk in Southern Chinese Han Population: A Case-Control Study

Effect of SYTL3-SLC22A3 Variants, Their Haplotypes, and G × E Interactions on Serum Lipid Levels and the Risk of Coronary Artery Disease and Ischaemic Stroke

Gene polymorphisms of LGALS2, LGALS3 and LGALS9 in patients with rheumatoid arthritis

Insights Into Forensic Features and Genetic Structures of Guangdong Maoming Han Based on 27 Y-STRs.

HLA Risk Alleles in Aromatic Antiepileptic Drug-Induced Maculopapular Exanthema.

Increased hypospadias risk by GREM1 rs3743104[G] in the southern Han Chinese population.

A nonsynonymous polymorphism (rs117179004, T392M) of hyaluronidase 1 (HYAL1) is associated with increased risk of idiopathic pulmonary fibrosis in Southern Han Chinese

The role of NOTCH3 variants in Alzheimer's disease and subcortical vascular dementia in the Chinese population

Mutations in the sodium channel genes SCN1A, SCN3A, and SCN9A in children with epilepsy with febrile seizures plus(EFS+)

Associations between SNP83 of phosphodiesterase 4D gene and carotid atherosclerosis in a southern Chinese Han population: a case-control study.