The common carp, Cyprinus carpio, is one of the most important cyprinid species and globally accounts for 10% of freshwater aquaculture production. Here we present a draft genome of domesticated C. carpio (strain Songpu), whose current assembly contains 52,610 protein-coding genes and approximately 92.3% coverage of its paleotetraploidized genome (2n = 100). The latest round of whole-genome duplication has been estimated to have occurred approximately 8.2 million years ago. Genome resequencing of 33 representative individuals from worldwide populations demonstrates a single origin for C. carpio in 2 subspecies (C. carpio Haematopterus and C. carpio carpio). Integrative genomic and transcriptomic analyses were used to identify loci potentially associated with traits including scaling patterns and skin color. In combination with the high-resolution genetic map, the draft genome paves the way for better molecular studies and improved genome-assisted breeding of C. carpio and other closely related species.
BackgroundGeneration of large mate-pair libraries is necessary for de novo genome assembly but the procedure is complex and time-consuming. Furthermore, in some complex genomes, it is hard to increase the N50 length even with large mate-pair libraries, which leads to low transcript coverage. Thus, it is necessary to develop other simple scaffolding approaches, to at least solve the elongation of transcribed fragments.ResultsWe describe L_RNA_scaffolder, a novel genome scaffolding method that uses long transcriptome reads to order, orient and combine genomic fragments into larger sequences. To demonstrate the accuracy of the method, the zebrafish genome was scaffolded. With expanded human transcriptome data, the N50 of human genome was doubled and L_RNA_scaffolder out-performed most scaffolding results by existing scaffolders which employ mate-pair libraries. In these two examples, the transcript coverage was almost complete, especially for long transcripts. We applied L_RNA_scaffolder to the highly polymorphic pearl oyster draft genome and the gene model length significantly increased.ConclusionsThe simplicity and high-throughput of RNA-seq data makes this approach suitable for genome scaffolding. L_RNA_scaffolder is available at http://www.fishbrowser.org/software/L_RNA_scaffolder.
Whole genome duplication (WGD) results in extensive genetic redundancy. In plants and yeast, WGD is followed by rapid gene deletions and intense expression differentiation with slow functional divergence. However, the early evolution of the gene differentiation processes is poorly understood in vertebrates because almost all studied WGDs are extremely ancient, and the genomes have returned to a diploid status. Common carp had a very recent fourth round of WGD dated to 8 million years ago. It therefore constitutes an ideal model to study early-stage functional divergence and expression differentiation in vertebrates. We identified 1,757 pairs of recently duplicated genes (RDGs) originating from this specific WGD and found that most ancestral genes were retained in duplicate. Most RDGs were conserved and under selective pressure. Gene expression analysis across six tissues revealed that 92.5% of RDG pairs were co-expressed in at least one tissue and that the expression of nearly half pairs ceased to be strongly correlated, indicating slow spatial divergence but rapid expression dissociation. Functional comparison revealed that 25% of pairs had functional divergence, of which neo- and sub-functionalization were the main outcomes. Our analysis revealed slow gene loss but rapid and intense expression and function differentiation after WGD.
For nearly 50 years, the vision of using single molecules in circuits has been seen as providing the ultimate miniaturization of electronic chips. An advanced example of such a molecular electronics chip is presented here, with the important distinction that the molecular circuit elements play the role of general-purpose single-molecule sensors. The device consists of a semiconductor chip with a scalable array architecture. Each array element contains a synthetic molecular wire assembled to span nanoelectrodes in a current monitoring circuit. A central conjugation site is used to attach a single probe molecule that defines the target of the sensor. The chip digitizes the resulting picoamp-scale current-versus-time readout from each sensor element of the array at a rate of 1,000 frames per second. This provides detailed electrical signatures of the single-molecule interactions between the probe and targets present in a solution-phase test sample. This platform is used to measure the interaction kinetics of single molecules, without the use of labels, in a massively parallel fashion. To demonstrate broad applicability, examples are shown for probe molecule binding, including DNA oligos, aptamers, antibodies, and antigens, and the activity of enzymes relevant to diagnostics and sequencing, including a CRISPR/Cas enzyme binding a target DNA, and a DNA polymerase enzyme incorporating nucleotides as it copies a DNA template. All of these applications are accomplished with high sensitivity and resolution, on a manufacturable, scalable, all-electronic semiconductor chip device, thereby bringing the power of modern chips to these diverse areas of biosensing.
Intermediate-risk acute myeloid leukemia (IR-AML), which accounts for a substantial number of AML cases, is highly heterogeneous. Although several mutations have been identified, the heterogeneity of AML is uncertain because novel mutations have yet to be discovered. Here we applied next generation sequencing (NGS) platform to screen mutational hotspots in 410 genes relevant to hematological malignancy. IR-AML samples (N=95) were sequenced by Illumina Hiseq and mutations in 101 genes were identified. Only seven genes (CEBPA, NPM1, DNMT3A, FLT3-ITD, NRAS, IDH2 and WT1) were mutated in more than 10% of patients. Genetic interaction analysis identified several cooperative and exclusive patterns of overlapping mutations. Mutational analysis indicated some correlation between genotype and phenotype. FLT3-ITD mutations were identified as independent factors of poor prognosis, while CEBPA mutations were independent favorable factors. Co-occurrence of FLT3-ITD, NPM1 and DNMT3A mutations was identified with associated with specific clinical AML features and poor outcomes. Furthermore, by integrating multiple mutations in the survival analysis, 95 IR-AML patients could be stratified into three distinct risk groups allowing reductions in IR-AML by one-third. Our study offers deep insights into the molecular pathogenesis and biology of AML and indicated that the prognosis of IR-AML could be further stratified by different mutation combinations which may direct future treatment intervention.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.