2017
DOI: 10.1038/celldisc.2017.31
|View full text |Cite
|
Sign up to set email alerts
|

Long read reference genome-free reconstruction of a full-length transcriptome from Astragalus membranaceus reveals transcript variants involved in bioactive compound biosynthesis

Abstract: Astragalus membranaceus, also known as Huangqi in China, is one of the most widely used medicinal herbs in Traditional Chinese Medicine. Traditional Chinese Medicine formulations from Astragalus membranaceus have been used to treat a wide range of illnesses, such as cardiovascular disease, type 2 diabetes, nephritis and cancers. Pharmacological studies have shown that immunomodulating, anti-hyperglycemic, anti-inflammatory, antioxidant and antiviral activities exist in the extract of Astragalus membranaceus. T… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

3
86
0

Year Published

2018
2018
2021
2021

Publication Types

Select...
4
3
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 105 publications
(89 citation statements)
references
References 37 publications
3
86
0
Order By: Relevance
“…The average length of consensus sequences ranged from 925 bp to 1,438 kb (Figure S1). The number of consensus transcripts significantly exceeded the expected number of expressed genes, however, this is consistent with other reference genome-free IsoSeq analyses (Li et al, 2017; Kuang et al, 2019; Yan et al, 2019). Inflated numbers of consensus transcripts can result from sequencing of multiple alternatively spliced isoforms of the same gene, sequencing of incompletely processed mRNA molecules (Martin et al, 2014), high sequence error rates preventing multiple sequences from the same transcript being collapsed into a consensus, divergent haplotypes of the same locus present in our clonally propagated, wild collected, or partially inbred starting material, or contamination of the original samples with mRNA from non-target organisms.…”
Section: Resultssupporting
confidence: 89%
“…The average length of consensus sequences ranged from 925 bp to 1,438 kb (Figure S1). The number of consensus transcripts significantly exceeded the expected number of expressed genes, however, this is consistent with other reference genome-free IsoSeq analyses (Li et al, 2017; Kuang et al, 2019; Yan et al, 2019). Inflated numbers of consensus transcripts can result from sequencing of multiple alternatively spliced isoforms of the same gene, sequencing of incompletely processed mRNA molecules (Martin et al, 2014), high sequence error rates preventing multiple sequences from the same transcript being collapsed into a consensus, divergent haplotypes of the same locus present in our clonally propagated, wild collected, or partially inbred starting material, or contamination of the original samples with mRNA from non-target organisms.…”
Section: Resultssupporting
confidence: 89%
“…While H. amplexicaulis exhibited the shortest consensus transcript length, this was not reflected in a reduced number of complete ORFs-those containing both an in-frame ATG and stop codon and occupying at least 60% of the total transcript length. The number of consensus transcripts significantly exceeded the expected number of expressed genes; however, this is consistent with other reference genome-freeIsoSeq analyses(Kuang, Sun, Wei, Li, & Sun, 2019;Li et al, 2017;Yan et al, 2019). Inflated numbers of consensus transcripts can result from sequencing of multiple alternatively spliced isoforms of the same gene, sequencing of incompletely processed mRNA molecules(Martin et al, 2014), high sequence error rates preventing multiple sequences from the same transcript being collapsed into a consensus, divergent haplotypes of the same locus present in our clonally propagated, wild collected, or partially inbred starting material, or contamination of the original samples with mRNA from non-target organisms.…”
supporting
confidence: 89%
“…Moreover, small laboratories require high sequencing costs due to the need for long reads and high-depth short read sequences to be accurate in de novo assembly. Plants with large genomes pose even more difficult as in, for example, the common soybean crop, which has a genome size of ∼1.1Gb [21]. To improve the comprehensive accuracy of gene prediction, there is a need to introduce a new approach, the “Isoform sequencing (Iso-Seq).” Thanks to its long-read technology, Iso-Seq facilitates identifying new isoforms with a high level of accuracy [22].…”
Section: Introductionmentioning
confidence: 99%