Over the past decade, the spotted wing Drosophila, Drosophila suzukii, has invaded Europe and America and has become a major agricultural pest in these areas, thereby prompting intense research activities to better understand its biology. Two draft genome assemblies already exist for this species but contain pervasive assembly errors and are highly fragmented, which limits their values. Our purpose here was to improve the assembly of the D. suzukii genome and to annotate it in a way that facilitates comparisons with D. melanogaster. For this, we generated PacBio longread sequencing data and assembled a novel, high-quality D. suzukii genome assembly. it is one of the largest Drosophila genomes, notably because of the expansion of its repeatome. We found that despite 16 rounds of full-sib crossings the D. suzukii strain that we sequenced has maintained high levels of polymorphism in some regions of its genome. As a consequence, the quality of the assembly of these regions was reduced. We explored possible origins of this high residual diversity, including the presence of structural variants and a possible heterogeneous admixture pattern of North American and Asian ancestry. Overall, our assembly and annotation constitute a high-quality genomic resource that can be used for both high-throughput sequencing approaches, as well as manipulative genetic technologies to study D. suzukii. Drosophila suzukii (Matsumura, 1931), the spotted wing Drosophila (Diptera: Drosophilidae), is an invasive fruit fly species originating from eastern Asia that has spread since 2008 in major parts of America and Europe. This species is still expanding its distribution 1,2 and is classified as a major pest on a variety of berries and stone fruit crops 3. Its behavior and phenotypic traits are now the subject of intense scrutiny both in the lab and in the field (reviewed in 4). Understanding the biology and the population dynamics of D. suzukii benefits from the production and mining of genomic and transcriptomic data, as well as manipulative genetic technologies including functional transgenesis and genome editing 5-7. Yet, the efficacy of these approaches relies critically on high-quality genomic resources. Currently, two D. suzukii genome assemblies, obtained from two different strains, have been generated based on short-read sequencing technologies 8,9. The utility of these valuable genomic resources is limited by the
Over the past decade, the spotted wing Drosophila, Drosophila suzukii, has invaded Europe and America and has become a major agricultural pest in these areas, thereby prompting intense research activities to better understand its biology. Two draft genome assemblies based on short-read sequencing were released in 2013 for this species. Although valuable, these resources contain pervasive assembly errors and are highly fragmented, two features limiting their values. Our purpose here was to improve the assembly of the D. suzukii genome. For this, we generated PacBio long-read sequencing data at 160X sequence coverage and assembled a novel, contiguous D. suzukii genome. We obtained a high-quality assembly of 270 Mb (with 546 contigs, a N50 of 2.6Mb, a L50 of 15, and a BUSCO score of 95%) that we called WT3-2.0. We found that despite 16 rounds of full-sib crossings the D. suzukii strain that we sequenced has maintained high levels of polymorphism in some regions of its genome (ca. 19Mb). As a consequence, the quality of the assembly of these regions was reduced. We explored possible origins of this high residual diversity, including the presence of structural variants and a possible heterogeneous admixture pattern of North American and Asian ancestry. Overall, our WT3-2.0 assembly provides a higher quality genomic resource compared to the previous one in terms of general assembly statistics, sequence quality and gene annotation. This new D. suzukii genome assembly is therefore an improved resource for high-throughput sequencing approaches, as well as manipulative genetic technologies to study D. suzukii.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.