Hair shed by domestic cats is a potentially useful source of forensic evidence. Analysable hair DNA is predominantly mitochondrial, but the recent domestication history of cats means that mtDNA diversity is low. A 402-bp control region segment is usually sequenced, defining only a small number of distinct haplotypes in populations. Previously, we used a long-amplicon approach to sequence whole mitogenomes in a sample of blood DNAs from 119 UK cats, greatly increasing observed diversity and reducing random match probabilities. To exploit this variation for forensic analysis, we here describe a multiplex system that amplifies the cat mitogenome in 60 overlapping amplicons of mean length 360 bp, followed by Nanopore sequencing. Variants detected in multiplex sequence data from unrooted hair completely mirror those from long-amplicon data from blood from the same individuals. However, applying the multiplex to matched blood DNA reveals additional sequence variants which derive from the major feline nuclear mitochondrial insertion sequence (numt), which covers 7.9 kb of the 17-kb mitogenome and exists in multiple tandem copies. We use long-amplicon Nanopore sequencing to investigate numt variation in a set of cats, together with an analysis of published genome sequences, and show that numt arrays are variable in both structure and sequence, thus providing a potential source of uncertainty when nuclear DNA predominates in a sample. Forensic application of the multiplex was demonstrated by matching hairs from a cat with skeletal remains from its putative mother, both of which shared a globally common haplotype at the control region. The random match probability in this case with the CR 402-bp segment was 0.21 and this decreased to 0.03 when considering the whole mitogenome. The developed multiplex and sequencing approach, when applied to cat hair where nuclear DNA is scarce, can provide a reliable and highly discriminating source of forensic genetic evidence from a single hair. The confounding effect of numt co-amplification in degraded samples where mixed sequences are observed can be mitigated by variant phasing, and by comparison with numt sequence diversity data, such as those presented here.
Read full abstract