Abstract

Historically, forensic STR panels have been unsuccessful for population assignment due to the limited ancestry information that can be derived from the non-coding STR loci and the low number of loci included in the panel. However, given the recent adoption of expanded (16+ loci) and ‘mega-plex’ (23+ loci) STR panels, the ability to identify source population groups may be improved. This study assessed the impact of increasing locus number on population assignment under different analysis conditions using a published US population dataset comprised of individuals from the African American, Caucasian, Hispanic and Asian populations. The Bayesian clustering programme STRUCTURE was used to assess first, whether increasing the number of loci and the inclusion of known sample population data enabled greater resolution between the four populations in the dataset, and second, the utility for population assignment using criteria based on inferred ancestry scores. Results suggest that increasing the number of loci and including population of origin data allowed the identification of more distinct populations, with three primary populations being observed; African American, Asian, and Caucasian/Hispanic. The close grouping of the Caucasian and Hispanic populations is supported by their recently common ancestry from Western Europe. The ability of the programme to support population assignment to each of the four existing populations was assessed through the application of population and panel specific assignment thresholds based on the inferred ancestry scores obtained from the analysis programme. Predictive accuracy based on a training dataset of 984 individuals suggest that assignment accuracy is > 96% across the four populations and can reach 100% under some test conditions. The accuracy was > 90% when blind testing was performed on 40 ‘unknown’ individuals. As such, the approach described is considered within the acceptable range for a presumptive test and can be performed using data already collected as part of routine forensic investigations.

Highlights

  • Inference of the ethnic origin of a suspect from their DNA recovered from a crime scene sample can act as ‘investigative intelligence’ and help enforcement agencies concentrate their resources in the absence of any other suspect specific information

  • The specific identification of Ancestry Informative Markers (AIMs) panels that are powerful at inferring ethnic origin has been the focus of much research and the assessment of different classes of molecular marker has slowly moved from mitochondrial DNA sequence variation, through autosomal and Y linked Short Tandem Repeat (STRs) markers to Single Nucleotide Polymorphisms (SNPs) [48]

  • The STRUCTURE analysis shows that the number of distinct genetic groups (K) identified varies depending on which method is used in the estimation (Table 1)

Read more

Summary

Introduction

Inference of the ethnic origin of a suspect from their DNA recovered from a crime scene sample can act as ‘investigative intelligence’ and help enforcement agencies concentrate their resources in the absence of any other suspect specific information This idea is not new and has been explored extensively in the literature through the development and application of assignment approaches that use genetic markers to identify unique genetic groups or populations [1,2,3]. The increasing number of STR loci may enable better resolution between populations, making expanded and mega-plex STR kits suitable for genetic differentiation between populations [21] and the inference of ethnic origin Such an approach would be useful if labs continue to use CE methods for the foreseeable future

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call