2020
DOI: 10.7717/peerj-cs.307
|View full text |Cite
|
Sign up to set email alerts
|

A machine learning framework for the prediction of chromatin folding inDrosophilausing epigenetic features

Abstract: Technological advances have lead to the creation of large epigenetic datasets, including information about DNA binding proteins and DNA spatial structure. Hi-C experiments have revealed that chromosomes are subdivided into sets of self-interacting domains called Topologically Associating Domains (TADs). TADs are involved in the regulation of gene expression activity, but the mechanisms of their formation are not yet fully understood. Here, we focus on machine learning methods to characterize DNA folding patter… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
16
0

Year Published

2021
2021
2025
2025

Publication Types

Select...
3
2
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 21 publications
(18 citation statements)
references
References 55 publications
0
16
0
Order By: Relevance
“…Author details 1 School of Computer Science, Queensland University of Technology, Brisbane, QLD 4000, Australia. 2…”
Section: Authors' Contributionsmentioning
confidence: 99%
See 1 more Smart Citation
“…Author details 1 School of Computer Science, Queensland University of Technology, Brisbane, QLD 4000, Australia. 2…”
Section: Authors' Contributionsmentioning
confidence: 99%
“…Recently, machine learning (ML) has become very widespread in research and has been incorporated in a variety of applications, including text mining, spam detection, video recommendation, image classification, and multimedia concept retrieval [1][2][3][4][5][6]. Among the different ML algorithms, deep learning (DL) is very commonly employed in these applications [7][8][9].…”
Section: Introductionmentioning
confidence: 99%
“…We compared TAPIOCA-network's performance on the task of TAD prediction to the performance of previously used models such as linear regression (Ulianov et al 2016), ridge and lasso regression (Ramírez et al 2018) and Bi-directional Long Short-Term Memory (BILSTM) (Rozenwald et al 2020) We observe visual similarity between the predictions of our network and Hi-C derived labels across all three metrics (Fig 4). TAPIOCA-network outperforms all previous approaches on the transitional gamma dataset (Table 1).…”
Section: Benchmark Of Tapioca Network Relative To Prior Artmentioning
confidence: 90%
“…Transitional gamma is computed by performing TAD calling using the armatus tool with gamma values 1-10 and assigning the transitional gamma of a loci to be the first gamma value at which armatus identifies a TAD boundary (Figure 1a). In our experiments we use the transitional gamma values assigned to the feature dataset by (Rozenwald et al 2020).…”
Section: Overview Of Dataset Features and Labelsmentioning
confidence: 99%
See 1 more Smart Citation