2020
DOI: 10.1101/2020.07.08.194308
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

A Convolution Based Computational Approach Towards DNA N6-methyladenine Site Identification and Motif Extraction in Rice Genome

Abstract: ABSTRACTDNA N6-methylation (6mA) in Adenine nucleotide is a post replication modification and is responsible for many biological functions. Experimental methods for genome wide 6mA site detection is an expensive and manual labour intensive process. Automated and accurate computational methods can help to identify 6mA sites in long genomes saving significant time and money. Our study develops a convolutional neural network based tool i6mA-CNN capable of identifying 6mA sites in … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 51 publications
(129 reference statements)
0
1
0
Order By: Relevance
“…I6mA-Fuse 24 constructs five Random Forest models using five separate feature encoding methods: one-hot, di-nucleotide binary, k-space spectral nucleus, k-mer, and EIIPs, and uses linear regression models to combine the prediction probability scores of five RF models based on a single encoding to predict the m6A sites of F. vesca and R. chinensis. I6mA-CNN 25 uses one-hot, dinucleotide binary, dinucleotide property and a hybrid encoding method to encode the sequence, and then conducts convolution and full join operations to obtain four models and perform fusion to predict methylation sites. Deep6mA 26 encodes the sequence as one pot, extracts, and abstracts the features of the sequence using a 5-layer convolutional neural network, and then uses LSTM to model the extracted abstract features in terms of time series, extracts the timing effect between high-level features, and then makes a full connection prediction.…”
Section: Introductionmentioning
confidence: 99%
“…I6mA-Fuse 24 constructs five Random Forest models using five separate feature encoding methods: one-hot, di-nucleotide binary, k-space spectral nucleus, k-mer, and EIIPs, and uses linear regression models to combine the prediction probability scores of five RF models based on a single encoding to predict the m6A sites of F. vesca and R. chinensis. I6mA-CNN 25 uses one-hot, dinucleotide binary, dinucleotide property and a hybrid encoding method to encode the sequence, and then conducts convolution and full join operations to obtain four models and perform fusion to predict methylation sites. Deep6mA 26 encodes the sequence as one pot, extracts, and abstracts the features of the sequence using a 5-layer convolutional neural network, and then uses LSTM to model the extracted abstract features in terms of time series, extracts the timing effect between high-level features, and then makes a full connection prediction.…”
Section: Introductionmentioning
confidence: 99%