DNA metabarcoding is promising for cost-effective biodiversity monitoring, but reliable diversity estimates are difficult to achieve and validate. Here we present and validate a method, called LULU, for removing erroneous molecular operational taxonomic units (OTUs) from community data derived by high-throughput sequencing of amplified marker genes. LULU identifies errors by combining sequence similarity and co-occurrence patterns. To validate the LULU method, we use a unique data set of high quality survey data of vascular plants paired with plant ITS2 metabarcoding data of DNA extracted from soil from 130 sites in Denmark spanning major environmental gradients. OTU tables are produced with several different OTU definition algorithms and subsequently curated with LULU, and validated against field survey data. LULU curation consistently improves α-diversity estimates and other biodiversity metrics, and does not require a sequence reference database; thus, it represents a promising method for reliable biodiversity estimation.
This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain. In this study, we investigate the role of gender as a factor that contributes to eye colour variation and suggest that the gender effect on eye colour is population specific.A total of 230 Italian individuals were typed for the six IrisPlex SNPs (rs12913832, rs1800407, rs12896399, rs1393350, rs16891982 and rs12203592). A quantitative eye colour score (Pixel Index of the Eye: PIE-score) was calculated based on digital eye images using the custom made DIAT software. The results were compared with those of Danish and Swedish population samples.As expected, we found HERC2 rs12913832 as the main predictor of human eye colour independently of ancestry. Furthermore, we found gender to be significantly associated with quantitative eye colour measurements in the Italian population sample. We found that the association was statistically significant only among Italian individuals typed as heterozygote GA for HERC2 rs12913832. Interestingly, we did not observe the same association in the Danish and Swedish population. This indicated that the gender effect on eye colour is population specific. We estimated the effect of gender on quantitative eye colour in the Italian population sample to be 4.9%. Among gender and the IrisPlex SNPs, gender ranked as the second most important predictor of human eye colour variation in Italians after HERC2 rs12913832. We, furthermore, tested the five lower ranked IrisPlex predictors, and evaluated all possible 3 6 (729) genotype combinations of the IrisPlex assay and their corresponding predictive values using the IrisPlex prediction model [4]. The results suggested that maximum three (rs12913832, rs1800407, rs16891982) of the six IrisPlex SNPs are useful in practical forensic genetic casework.
BackgroundThe color of the eyes is one of the most prominent phenotypes in humans and it is often used to describe the appearance of an individual. The intensity of pigmentation in the iris is strongly associated with one single‐nucleotide polymorphism (SNP), rs12913832:A>G that is located in the promotor region of OCA2 (OMIM #611409). Nevertheless, many eye colors cannot be explained by only considering rs12913832:A>G.MethodsIn this study, we searched for additional variants in OCA2 to explain human eye color by sequencing a 500 kbp region, encompassing OCA2 and its promotor region.ResultsWe identified three nonsynonymous OCA2 variants as important for eye color, including rs1800407:G>A (p.Arg419Gln) and two variants, rs74653330:A>T (p.Ala481Thr) and rs121918166:G>A (p.Val443Ile), not previously described as important for eye color variation. It was shown that estimated haplotypes consisting of four variants (rs12913832:A>G, rs1800407:G>A (p.Arg419Gln), rs74653330:A>T (p.Ala481Thr), and rs121918166:G>A (p.Val443Ile)) explained 75.6% (adjusted R 2 = 0.76) of normal eye color variation, whereas rs12913832:A>G alone explained 68.8% (adjusted R 2 = 0.69). Moreover, rs74653330:A>T (p.Ala481Thr) and rs121918166:G>A (p.Val443Ile) had a measurable effect on quantitative skin color (P = 0.008).ConclusionOur data showed that rs74653330:A>T (p.Ala481Thr) and rs121918166:G>A (p.Val443Ile) have a measurable effect on normal pigmentation variation.
Background Viruses and other infectious agents cause more than 15% of human cancer cases. High-throughput sequencing-based studies of virus-cancer associations have mainly focused on cancer transcriptome data. Methods In this study, we applied a diverse selection of presequencing enrichment methods targeting all major viral groups, to characterize the viruses present in 197 samples from 18 sample types of cancerous origin. Using high-throughput sequencing, we generated 710 datasets constituting 57 billion sequencing reads. Results Detailed in silico investigation of the viral content, including exclusion of viral artefacts, from de novo assembled contigs and individual sequencing reads yielded a map of the viruses detected. Our data reveal a virome dominated by papillomaviruses, anelloviruses, herpesviruses, and parvoviruses. More than half of the included samples contained 1 or more viruses; however, no link between specific viruses and cancer types were found. Conclusions Our study sheds light on viral presence in cancers and provides highly relevant virome data for future reference.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.