Researchers have assembled thousands of eukaryotic genomes using Illumina reads, but traditional mate‐pair libraries cannot span all repetitive elements, resulting in highly fragmented assemblies. However, both chromosome conformation capture techniques, such as Hi‐C and Dovetail Genomics Chicago libraries and long‐read sequencing, such as Pacific Biosciences and Oxford Nanopore, help span and resolve repetitive regions and therefore improve genome assemblies. One important livestock species of arid regions that does not have a high‐quality contiguous reference genome is the dromedary ( Camelus dromedarius ). Draft genomes exist but are highly fragmented, and a high‐quality reference genome is needed to understand adaptation to desert environments and artificial selection during domestication. Dromedaries are among the last livestock species to have been domesticated, and together with wild and domestic Bactrian camels, they are the only representatives of the Camelini tribe, which highlights their evolutionary significance. Here we describe our efforts to improve the North African dromedary genome. We used Chicago and Hi‐C sequencing libraries from Dovetail Genomics to resolve the order of previously assembled contigs, producing almost chromosome‐level scaffolds. Remaining gaps were filled with Pacific Biosciences long reads, and then scaffolds were comparatively mapped to chromosomes. Long reads added 99.32 Mbp to the total length of the new assembly. Dovetail Chicago and Hi‐C libraries increased the longest scaffold over 12‐fold, from 9.71 Mbp to 124.99 Mbp and the scaffold N50 over 50‐fold, from 1.48 Mbp to 75.02 Mbp. We demonstrate that Illumina de novo assemblies can be substantially upgraded by combining chromosome conformation capture and long‐read sequencing.
The genomic structure of the Major Histocompatibility Complex (MHC) region and variation in selected MHC class I related genes in Old World camels, Camelus bactrianus and Camelus dromedaries were studied. The overall genomic organization of the camel MHC region follows a general pattern observed in other mammalian species and individual MHC loci appear to be well conserved. Selected MHC class I genes B‐67 and BL3‐7 exhibited unexpectedly low variability, even when compared to other camel MHC class I related genes MR1 and MICA. Interspecific SNP and allele sharing are relatively common, and frequencies of heterozygotes are usually low. Such a low variation in a genomic region generally considered as one of the most polymorphic in vertebrate genomes is unusual. Evolutionary relationships between MHC class I related genes and their counterparts from other species seem to be rather complex. Often, they do not follow the general evolutionary history of the species concerned. Close evolutionary relationships of individual MHC class I loci between camels, humans and dogs were observed. Based on the results of this study and on our data on MHC class II genes, the extent and the pattern of polymorphism of the MHC region of Old World camelids differed from most mammalian groups studied so far. Camels thus seem to be an important model for our understanding of the role of genetic diversity in immune functions, especially in the context of unique features of their immunoglobulin and T‐cell receptor genes.
Single nucleotide polymorphisms (SNPs) are replacing microsatellites for population genetic analyses, but it is not apparent how many SNPs are needed or how well SNPs correlate with microsatellites. We used data from the gopher tortoise, Gopherus polyphemus – a species with small populations, to compare SNPs and microsatellites to estimate population genetic parameters. Specifically, we compared one SNP dataset (16 tortoises from 4 populations sequenced at 17,901 SNPs) to two microsatellite datasets, a full dataset of 101 tortoises and a partial dataset of 16 tortoises previously genotyped at 10 microsatellites. For the full microsatellite dataset, observed heterozygosity, expected heterozygosity, and FST were correlated between SNPs and microsatellites; however, allelic richness was not. The same was true for the partial microsatellite dataset, except that allelic richness, but not observed heterozygosity, was correlated. The number of clusters estimated by Structure differed for each dataset (SNPs = 2; partial microsatellite = 3; full microsatellite = 4). PCA showed four clusters for all datasets. More than 800 SNPs were needed to correlate with allelic richness, observed heterozygosity, and expected heterozygosity, but only 100 were needed for FST. The number of SNPs typically obtained from NGS far exceeds the number needed to correlate with microsatellite parameter estimates. Our study illustrates that diversity, FST, and PCA results from microsatellites can mirror those obtained with SNPs. These results may be generally applicable to small populations, a defining feature of endangered and threatened species, because theory predicts that genetic drift will tend to outweigh selection in small populations.
Due to production of special homodimeric heavy chain antibodies, somatic hypermutation of their T-cell receptor genes and unusually low diversity of their major histocompatibility complex genes, camels represent an important model for immunogenetic studies. Here, we analyzed genes encoding selected natural killer cell receptors with a special focus on genes encoding receptors for major histocompatibility complex (MHC) class I ligands in the two domestic camel species, Camelus dromedarius and Camelus bactrianus . Based on the dromedary genome assembly CamDro2, we characterized the genetic contents, organization, and variability of two complex genomic regions, the leukocyte receptor complex and the natural killer complex, along with the natural cytotoxicity receptor genes NCR1 , NCR2 , and NCR3 . The genomic organization of the natural killer complex region of camels differs from cattle, the phylogenetically most closely related species. With its minimal set of KLR genes, it resembles this complex in the domestic pig. Similarly, the leukocyte receptor complex of camels is strikingly different from its cattle counterpart. With KIR pseudogenes and few LILR genes, it seems to be simpler than in the pig. The syntenies and protein sequences of the NCR1 , NCR2 , and NCR3 genes in the dromedary suggest that they could be human orthologues. However, only NCR1 and NCR2 have a structure of functional genes, while NCR3 appears to be a pseudogene. High sequence similarities between the two camel species as well as with the alpaca Vicugna pacos were observed. The polymorphism in all genes analyzed seems to be generally low, similar to the rest of the camel genomes. This first report on natural killer cell receptor genes in camelids adds new data to our understanding of specificities of the camel immune system and its functions, extends our genetic knowledge of the innate immune variation in dromedaries and Bactrian camels, and contributes to studies of natural killer cell receptors evolution in mammals.
Dromedaries have been essential for the prosperity of civilizations in arid environments and the dispersal of humans, goods and cultures along ancient, cross-continental trading routes. With increasing desertification their importance as livestock species is rising rapidly, but little is known about their genome-wide diversity and demographic history. As previous studies using few nuclear markers found weak phylogeographic structure, here we detected fine-scale population differentiation in dromedaries across Asia and Africa by adopting a genome-wide approach. Global patterns of effective migration rates revealed pathways of dispersal after domestication, following historic caravan routes like the Silk and Incense Roads. Our results show that a Pleistocene bottleneck and Medieval expansions during the rise of the Ottoman empire have shaped genome-wide diversity in modern dromedaries. By understanding subtle population structure we recognize the value of small, locally adapted populations and appeal for securing genomic diversity for a sustainable utilization of this key desert species.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.