Though investigations into the use of massively parallel sequencing technologies for the generation of complete mitochondrial genome (mtGenome) profiles from difficult forensic specimens are well underway in multiple laboratories, the high quality population reference data necessary to support full mtGenome typing in the forensic context are lacking. To address this deficiency, we have developed 588 complete mtGenome haplotypes, spanning three U.S. population groups (African American, Caucasian and Hispanic) from anonymized, randomly-sampled specimens. Data production utilized an 8-amplicon, 135 sequencing reaction Sanger-based protocol, performed in semi-automated fashion on robotic instrumentation. Data review followed an intensive multi-step strategy that included a minimum of three independent reviews of the raw data at two laboratories; repeat screenings of all insertions, deletions, heteroplasmies, transversions and any additional private mutations; and a check for phylogenetic feasibility. For all three populations, nearly complete resolution of the haplotypes was achieved with full mtGenome sequences: 90.3-98.8% of haplotypes were unique per population, an improvement of 7.7-29.2% over control region sequencing alone, and zero haplotypes overlapped between populations. Inferred maternal biogeographic ancestry frequencies for each population and heteroplasmy rates in the control region were generally consistent with published datasets. In the coding region, nearly 90% of individuals exhibited length heteroplasmy in the 12418-12425 adenine homopolymer; and despite a relatively high rate of point heteroplasmy (23.8% of individuals across the entire molecule), coding region point heteroplasmies shared by more than one individual were notably absent, and transversion-type heteroplasmies were extremely rare. The ratio of nonsynonymous to synonymous changes among point heteroplasmies in the protein-coding genes (1:1.3) and average pathogenicity scores in comparison to data reported for complete substitutions in previous studies seem to provide some additional support for the role of purifying selection in the evolution of the human mtGenome. Overall, these thoroughly vetted full mtGenome population reference data can serve as a standard against which the quality and features of future mtGenome datasets (especially those developed via massively parallel sequencing) may be evaluated, and will provide a solid foundation for the generation of complete mtGenome haplotype frequency estimates for forensic applications.
DNA methylation is known as a biomarker for age with applications in forensics. Here we describe the VISAGE (VISible Attributes through GEnomics) Consortium’s enhanced tool for epigenetic age estimation in somatic tissues. The tool is based on eight DNA methylation markers (44 CpGs), bisulfite multiplex PCR followed by sequencing on the MiSeq FGx platform, and three statistical prediction models for blood, buccal cells and bones. The model for blood is based on six CpGs from
ELOVL2
,
MIR29B2CHG
,
KLF14
,
FHL2
,
TRIM59
and
PDE4C
, and predicts age with a mean absolute error (MAE) of 3.2 years, while the model for buccal cells includes five CpGs from
PDE4C
,
MIR29B2CHG
,
ELOVL2
,
KLF14
and
EDARADD
and predicts age with MAE of 3.7 years, and the model for bones has six CpGs from
ELOVL2, KLF14, PDE4C
and
ASPA
and predicts age with MAE of 3.4 years. The VISAGE enhanced tool for age estimation in somatic tissues enables reliable collection of DNA methylation data from small amounts of DNA using a sensitive multiplex MPS assay that provides accurate estimation of age in blood, buccal swabs, and bones using the statistical model tailored to each tissue.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.