2018
DOI: 10.1038/s41467-018-06910-x
|View full text |Cite
|
Sign up to set email alerts
|

Deciphering highly similar multigene family transcripts from Iso-Seq data with IsoCon

Abstract: A significant portion of genes in vertebrate genomes belongs to multigene families, with each family containing several gene copies whose presence/absence, as well as isoform structure, can be highly variable across individuals. Existing de novo techniques for assaying the sequences of such highly-similar gene families fall short of reconstructing end-to-end transcripts with nucleotide-level precision or assigning alternatively spliced transcripts to their respective gene copies. We present IsoCon, a high-prec… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
57
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
5
3
2

Relationship

2
8

Authors

Journals

citations
Cited by 60 publications
(57 citation statements)
references
References 69 publications
0
57
0
Order By: Relevance
“…This is a particularly useful approach in species that lack a reference genome, but it comes with disadvantages. ICE-Quiver has been known to merge together transcripts from highly similar genes and can smooth over real differences of interest such as sequence variants and RNA editing events 30 . In addition, the algorithm is stochastic by nature, and cluster assignments for individual reads can vary substantially across different runs.…”
Section: Introductionmentioning
confidence: 99%
“…This is a particularly useful approach in species that lack a reference genome, but it comes with disadvantages. ICE-Quiver has been known to merge together transcripts from highly similar genes and can smooth over real differences of interest such as sequence variants and RNA editing events 30 . In addition, the algorithm is stochastic by nature, and cluster assignments for individual reads can vary substantially across different runs.…”
Section: Introductionmentioning
confidence: 99%
“…In this study we uncovered many principal features of ampliconic sequence and gene evolution, opening opportunities for new inquiries. Future investigations should focus on deciphering the sequences of different copies and isoforms of ampliconic genes (70) , which should allow one to examine natural selection operating at them in detail. Sequence amplification and increase in gene copy number (17) in orangutan should be examined further as well.…”
Section: Future Directionsmentioning
confidence: 99%
“…Long-read sequencing of transcripts with Pacific Biosciences (PacBio) Iso-Seq and Oxford Nanopore Technologies (ONT) has proven to be central to the study of complex isoform landscapes in, e.g., humans [1][2][3][4], animals [5], plants [6], fungi [7] and viruses [8]. Long reads can reconstruct more complex regions than can short RNA-seq reads because the often complex assembly step is not required.…”
Section: Introductionmentioning
confidence: 99%