Genomics reveals the origins of ancient specimens

Cong, Qian; Shen, Jinhui; Zhang, Jing; Li, Wenlin; Kinch, Lisa N.; Calhoun, John V.; Warren, Andrew; Grishin, Nick V.

doi:10.1101/752121

Cited by 11 publications

(2 citation statements)

References 25 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…This was to be expected in whole genome shotgun sequences from museum samples because of the post-mortem DNA modifications [25,90]. Overall the sequences obtained from the Monarch kit extracts are comparable with the quality of shotgun sequences from other museum material in the literature, while the age of our extracted specimens is exceptional among studies including dry pinned insect specimens [89,100,101]. Nevertheless, enrichment of these shotgun libraries will be necessary for higher sequencing efficiency [87-89, 98, 102, 103].…”

Section: Plos Onesupporting

confidence: 69%

Advantages of an easy-to-use DNA extraction method for minimal-destructive analysis of collection specimens

2020

View full text Add to dashboard Cite

Here we present and justify an approach for minimal-destructive DNA extraction from historic insect specimens for next generation sequencing applications. An increasing number of studies use insects from museum collections for biodiversity research. However, the availability of specimens for molecular analyses has been limited by the degraded nature of the DNA gained from century-old museum material and the consumptive nature of most DNA extraction procedures. The method described in this manuscript enabled us to successfully extract DNA from specimens as old as 241 years using a minimal-destructive approach. The direct comparison of the DNeasy extraction Kit and the Monarch® PCR & DNA Clean-up Kit showed a significant increase of 17.3-fold higher DNA yield extracted with the Monarch Oligo protocol on average. By using an extraction protocol originally designed for oligonucleotide clean-up, we were able to combine overcoming the restrictions by target fragment size and strand state, with minimising time consumption and labour-intensity. The type specimens used for the minimal-destructive DNA extraction exhibited no significant external change or post-extraction damage, while sufficient DNA was retrieved for analyses.

show abstract

Section: Plos Onesupporting

confidence: 69%

Advantages of an easy-to-use DNA extraction method for minimal-destructive analysis of collection specimens

2020

View full text Add to dashboard Cite

show abstract

“…The most common approaches to estimating sample locations are based on unsupervised genotype clustering or dimensionality reduction techniques. Genetic data from samples of both known and unknown origin are jointly analyzed, and unknown samples are assigned to the location of known individuals with which they share a genotype cluster or region of PC space (Breidenbach et al, 2019; Battey et al, 2018; Cong et al, 2019). However, these methods require an additional mapping from genotype clusters or PC space to geography, and can produce nonsensical results if unknown samples are hybrids or do not originate from any of the sampled reference populations.…”

Section: Introductionmentioning

confidence: 99%

Predicting Geographic Location from Genetic Variation with Deep Neural Networks

Battey

Ralph

Kern

2019

Preprint

View full text Add to dashboard Cite

6Most organisms are more closely related to nearby than distant members of their species, creating 7 spatial autocorrelations in genetic data. This allows us to predict the location of origin of a genetic 8 sample by comparing it to a set of samples of known geographic origin. Here we describe a deep 9 learning method, which we call Locator, to accomplish this task faster and more accurately than 10 existing approaches. In simulations, Locator infers sample location to within 4.1 generations of 11 dispersal and runs at least an order of magnitude faster than a recent model-based approach. 12 We leverage Locator's computational efficiency to predict locations separately in windows across 13 the genome, which allows us to both quantify uncertainty and describe the mosaic ancestry and 14 patterns of geographic mixing that characterize many populations. Applied to whole-genome 15 sequence data from Plasmodium parasites, Anopheles mosquitoes, and global human populations, 16 this approach yields median test errors of 16.9km, 5.7km, and 85km, respectively. 17 Introduction 18In natural populations, local mate selection and dispersal create correlations between geographic 19 location and genetic variation -each individual's genome is a mosaic of material inherited from 20 recent ancestors that are usually geographically nearby. Given a set of genotyped individuals of 21 known geographic provenance, it is therefore possible to predict the location of new samples from 22 genetic information alone (Guillot et al., 2015; Yang et al., 2012; Wasser et al., 2004; Rañola et al., 23 2014; Bhaskar et al., 2016; Baran et al., 2013). This task has forensic applications -for example, 24 estimating the location of trafficked elephant ivory as in Wasser et al. (2004) -and also offers a way 25 to analyze variation in geographic ancestry without assuming the existence of discrete ancestral 26 populations. 27The most common approaches to estimating sample locations are based on unsupervised geno-28 type clustering or dimensionality reduction techniques. Genetic data from samples of both known 29 and unknown origin are jointly analyzed, and unknown samples are assigned to the location of 30 known individuals with which they share a genotype cluster or region of PC space (Breidenbach 31 et al., 2019; Battey et al., 2018; Cong et al., 2019). However, these methods require an additional 32 mapping from genotype clusters or PC space to geography, and can produce nonsensical results if 33 unknown samples are hybrids or do not originate from any of the sampled reference populations. 34Existing methods for estimating sample location that explicitly model continuous landscapes 35 use a two-step procedure. A smoothed map describing variation in allele frequencies over space 36 1 Battey et al. 2019Locator is first estimated for each allele based on the genotypes of individuals with known locations, and 37 locations of new samples are then predicted by maximizing the likelihood of observing a given 38 combination of alleles at the p...

show abstract