Copy number variants (CNVs) are structural variants within the eukaryotic genome that vary among individuals of a species. These variants have been associated with different phenotypic traits, making them a valuable consideration as markers for designing breeding programmes. In this study, whole genome sequence data of 60 dromedary camel samples originating from the Arabian Peninsula were analyzed to construct a comprehensive dromedary CNV map. Utilizing four CNV callers employing read-depth, split-read and paired-end mapping approaches, a total of 37,519 CNV events (17,847 deletions and 19,672 duplications) were called on the dromedary autosomes. These CNV events were merged into 2,557 regions, categorized as 1,322 losses, 122 gains, and 1,113 “mixed regions” comprising both types. The cumulative size of the CNV regions amounted to 22.5 Mb, covering roughly 1.16% of the dromedary autosomes. Approximately 32% of the defined CNV regions (comprising 60% losses, 18% gains, and 0.27% mixed regions) were found in ≥ 90% of the dromedary samples, classifying them as prevalent regions. Genes with biological functions related to the different adaptive physiologies of dromedary camels, such as fertility, heat stress, musculoskeletal development, and fat metabolism, were overlapping with or in close proximity to ~ 68% of the defined CNV regions, demonstrating their potential role in dromedaries’ physiology. This study presents the first comprehensive CNV map of dromedary camels and builds on the present knowledge in understanding the genetic structure of this species.
Supplementary Information
The online version contains supplementary material available at 10.1038/s41598-024-77773-0.