Jin Yang scite author profile

The Para rubber tree (Hevea brasiliensis) is an economically important tropical tree species that produces natural rubber, an essential industrial raw material. Here we present a high-quality genome assembly of this species (1.37 Gb, scaffold N50 = 1.28 Mb) that covers 93.8% of the genome (1.47 Gb) and harbours 43,792 predicted protein-coding genes. A striking expansion of the REF/SRPP (rubber elongation factor/small rubber particle protein) gene family and its divergence into several laticifer-specific isoforms seem crucial for rubber biosynthesis. The REF/SRPP family has isoforms with sizes similar to or larger than SRPP1 (204 amino acids) in 17 other plants examined, but no isoforms with similar sizes to REF1 (138 amino acids), the predominant molecular variant. A pivotal point in Hevea evolution was the emergence of REF1, which is located on the surface of large rubber particles that account for 93% of rubber in the latex (despite constituting only 6% of total rubber particles, large and small). The stringent control of ethylene synthesis under active ethylene signalling and response in laticifers resolves a longstanding mystery of ethylene stimulation in rubber production. Our study, which includes the re-sequencing of five other Hevea cultivars and extensive RNA-seq data, provides a valuable resource for functional genomics and tools for breeding elite Hevea cultivars.

show abstract

A comparison between ribo-minus RNA-sequencing and polyA-selected RNA-sequencing

Cui

et al. 2010

View full text Add to dashboard Cite

To compare the two RNA-sequencing protocols, ribo-minus RNA-sequencing (rmRNA-seq) and polyA-selected RNA-sequencing (mRNA-seq), we acquired transcriptomic data-52 and 32 million alignable reads of 35 bases in length-from the mouse cerebrum, respectively. We found that a higher proportion, 44% and 25%, of the uniquely alignable rmRNA-seq reads, is in intergenic and intronic regions, respectively, as compared to 23% and 15% from the mRNA-seq dataset. Further analysis made an additional discovery of transcripts of protein-coding genes (such as Histone, Heg1, and Dux), ncRNAs, snoRNAs, snRNAs, and novel ncRNAs as well as repeat elements in rmRNA-seq dataset. This result suggests that rmRNA-seq method should detect more polyA- or bimorphic transcripts. Finally, through comparative analyses of gene expression profiles among multiple datasets, we demonstrated that different RNA sample preparations may result in significant variations in gene expression profiles.

show abstract

PacBio full‐length cDNA sequencing integrated with RNA‐seq reads drastically improves the discovery of splicing transcripts in rice

et al. 2018

View full text Add to dashboard Cite

In eukaryotes, alternative splicing (AS) greatly expands the diversity of transcripts. However, it is challenging to accurately determine full-length splicing isoforms. Recently, more studies have taken advantage of Pacific Bioscience (PacBio) long-read sequencing to identify full-length transcripts. Nevertheless, the high error rate of PacBio reads seriously offsets the advantages of long reads, especially for accurately identifying splicing junctions. To best capitalize on the features of long reads, we used Illumina RNA-seq reads to improve PacBio circular consensus sequence (CCS) quality and to validate splicing patterns in the rice transcriptome. We evaluated the impact of CCS accuracy on the number and the validation rate of splicing isoforms, and integrated a comprehensive pipeline of splicing transcripts analysis by Iso-Seq and RNA-seq (STAIR) to identify the full-length multi-exon isoforms in rice seedling transcriptome (Oryza sativa L. ssp. japonica). STAIR discovered 11 733 full-length multi-exon isoforms, 6599 more than the SMRT Portal RS_Iso-Seq pipeline did. Of these splicing isoforms identified, 4453 (37.9%) were missed in assembled transcripts from RNA-seq reads, and 5204 (44.4%), including 268 multi-exon long non-coding RNAs (lncRNAs), were not reported in the MSU_osa1r7 annotation. Some randomly selected unreported splicing junctions were verified by polymerase chain reaction (PCR) amplification. In addition, we investigated alternative polyadenylation (APA) events in transcripts and identified 829 major polyadenylation [poly(A)] site clusters (PACs). The analysis of splicing isoforms and APA events will facilitate the annotation of the rice genome and studies on the expression and polyadenylation of AS genes in different developmental stages or growth conditions of rice.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Jin Yang

The rubber tree genome reveals new insights into rubber production and species adaptation

A comparison between ribo-minus RNA-sequencing and polyA-selected RNA-sequencing

PacBio full‐length cDNA sequencing integrated with RNA‐seq reads drastically improves the discovery of splicing transcripts in rice

Contact Info

Product

Resources

About