2020
DOI: 10.1089/cmb.2019.0286
|View full text |Cite
|
Sign up to set email alerts
|

More Accurate Transcript Assembly via Parameter Advising

Abstract: Computational tools used for genomic analyses are becoming more accurate but also increasingly sophisticated and complex. This introduces a new problem in that these pieces of software have a large number of tunable parameters that often have a large influence on the results that are reported. We quantify the impact of parameter choice on transcript assembly and take some first steps toward generating a truly automated genomic analysis pipeline by developing a method for automatically choosing input-specific p… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
23
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
2

Relationship

2
4

Authors

Journals

citations
Cited by 6 publications
(23 citation statements)
references
References 25 publications
0
23
0
Order By: Relevance
“…The heuristic constructions used in this framework, such as the way to define the similarity between two functions, the form of the contrastive loss function, and the architecture of the encoding network can be further improved in the future. Additionally, following DeBlasio et al [2020], we use AUC to estimate the performance of transcript assemblers by comparing predicted transcripts with known ones. However, this is not a perfect measure as it may unfairly penalize assemblers that generate novel transcripts, which can be of significant interest.…”
Section: Discussionmentioning
confidence: 99%
“…The heuristic constructions used in this framework, such as the way to define the similarity between two functions, the form of the contrastive loss function, and the architecture of the encoding network can be further improved in the future. Additionally, following DeBlasio et al [2020], we use AUC to estimate the performance of transcript assemblers by comparing predicted transcripts with known ones. However, this is not a perfect measure as it may unfairly penalize assemblers that generate novel transcripts, which can be of significant interest.…”
Section: Discussionmentioning
confidence: 99%
“…Then, the hypothesis test probability ( p -value) was calculated by statistical model. Third, multiple hypothesis testing correction (Benjiamini and Hochberg method) ( 14 ) was performed to obtain the corrected p -value (false discovery rate, FDR). The DESeq2 package in R was used to screen genes with significant differences between different samples ( 15 ).…”
Section: Methodsmentioning
confidence: 99%
“…While the transcript assembly problem has attracted great interest from the community, with a proliferation of methods proposed [10], [11], [15], [16], [17], [18], [19], [20], [23], [56], [57], [58], [59], [60], [61], assembling RNA-seq reads remains a challenge, with RNA assembly methods having a precision under 50%-60% on human data [18], [62]. In addition, current algorithms that aim to produce fulllength transcripts employ various heuristics and thresholds to increase contiguity of the transcript under assembly, which makes results vary significantly in quality with different parameter settings [63].…”
Section: Application To Rna Transcript Assemblymentioning
confidence: 99%