Abstract

Non-allelic homologous recombination (NAHR) is one of the key mechanisms of DNA rearrangement. NAHR occurring between direct homologous repeats can generate genomic copy number variation (CNV) and make significant contributions to both genome evolution and human diseases such as cancer. Intriguingly, previous observations on the rare CNVs at certain genomic disorder loci suggested that NAHR frequency could be dependent on homology properties. However, such a correlation remains unclear at the other NAHR-mediated CNV loci, especially the common CNVs in human populations. Different from the rare CNVs associated with genomic disorders, it is challenging to identify de novo NAHR events at common CNV loci. Therefore, our previously proposed statistic M was employed in estimating relative mutation rate for the NAHR-mediated CNVs in human populations. By utilizing generalized regression neural network and principal component analysis in studying 4330 CNVs ascertained in 3 HapMap populations, we identified the CNVs mediated by NAHR between paired segmental duplications (SDs) and further revealed the correlations between SD properties and NAHR probability. SD length and inter-SD distance were shown to make major contributions to the occurrence of NAHR, whereas chromosomal position and sequence similarity of paired SDs are also involved in NAHR. An integrated effect of SD properties on NAHR frequency was revealed for the common CNVs in human populations. These observations can be well explained by ectopic synapsis in NAHR together with our proposed model of chromosomal compression/extension/looping (CCEL) for homology mis-pairing. Our findings showed the important roles of SDs in NAHR and human genomic evolution.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call