Relevant for various areas of human genetics, Y-chromosomal short tandem repeats (Y-STRs) are commonly used for testing close paternal relationships among individuals and populations, and for male lineage identification. However, even the widely used 17-loci Yfiler set cannot resolve individuals and populations completely. Here, 52 centers generated quality-controlled data of 13 rapidly mutating (RM) Y-STRs in 14,644 related and unrelated males from 111 worldwide populations. Strikingly, >99% of the 12,272 unrelated males were completely individualized. Haplotype diversity was extremely high (global: 0.9999985, regional: 0.99836–0.9999988). Haplotype sharing between populations was almost absent except for six (0.05%) of the 12,156 haplotypes. Haplotype sharing within populations was generally rare (0.8% nonunique haplotypes), significantly lower in urban (0.9%) than rural (2.1%) and highest in endogamous groups (14.3%). Analysis of molecular variance revealed 99.98% of variation within populations, 0.018% among populations within groups, and 0.002% among groups. Of the 2,372 newly and 156 previously typed male relative pairs, 29% were differentiated including 27% of the 2,378 father–son pairs. Relative to Yfiler, haplotype diversity was increased in 86% of the populations tested and overall male relative differentiation was raised by 23.5%. Our study demonstrates the value of RM Y-STRs in identifying and separating unrelated and related males and provides a reference database.
The value of the evidence depends critically on propositions. In the second of two papers intended to provide advice to the community on difficult aspects of evaluation and the formulation of propositions, we focus primarily on activity level propositions. This helps the court address the question of "How did an individual's cell material get there?". In order to do this, we expand the framework outlined in the first companion paper. First, it is important not to conflate results and propositions. Statements given activity level propositions aim to help address issues of indirect vs direct transfer, and the time of the activity, but it is important to avoid use of the word 'transfer' in propositions. This is because propositions are assessed by the Court, but DNA transfer is a factor that scientists need to take into account for the interpretation of their results. Suitable activity level propositions are ideally set before knowledge of the results and address issues like: X stabbed Y vs. an unknown person stabbed Y but X met Y the day before. The scientist assigns the probability of the evidence, if each of the alternate propositions is true, to derive a likelihood ratio. To do this, the scientist asks: a) "what are the expectations if each of the propositions is true?" b) "What data are available to assist in the evaluation of the results given the propositions?" When presenting evidence, scientists work within the hierarchy of propositions framework. The value of evidence calculated for a DNA profile cannot be carried over to higher levels in the hierarchy -the calculations given sub-source, source and activity level propositions are all separate. A number of examples are provided to illustrate the principles espoused, and the criteria that such assessments should meet. Ideally in order to assign probabilities, the analyst should have/collect data that are relevant to the case in
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.