2017 International Conference on Computer and Drone Applications (IConDA) 2017
DOI: 10.1109/iconda.2017.8270400
|View full text |Cite
|
Sign up to set email alerts
|

Evaluation of convolutionary neural networks modeling of DNA sequences using ordinal versus one-hot encoding method

Abstract: Abstract-Convolutionary neural network (CNN) is a popular choice for supervised DNA motif prediction due to its excellent performances. To employ CNN, the input DNA sequences are required to be encoded as numerical values and represented as either vectors or multi-dimensional matrices. This paper evaluates a simple and more compact ordinal encoding method versus the popular one-hot encoding for DNA sequences. We compare the performances of both encoding methods using three sets of datasets enriched with DNA mo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
18
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
4
4
1
1

Relationship

0
10

Authors

Journals

citations
Cited by 39 publications
(18 citation statements)
references
References 16 publications
0
18
0
Order By: Relevance
“…When processing the DNA sequence, it is necessary to convert the string sequence into a numerical value, so as to form a matrix input model training. Generally speaking, there are three methods for sequence encoding: sequential encoding, one-hot encoding, and k-mer encoding ( Choong and Lee, 2017 ). The characteristics of the three DNA encoding methods are shown in Table 1 .…”
Section: Basic Knowledge Of Dnamentioning
confidence: 99%
“…When processing the DNA sequence, it is necessary to convert the string sequence into a numerical value, so as to form a matrix input model training. Generally speaking, there are three methods for sequence encoding: sequential encoding, one-hot encoding, and k-mer encoding ( Choong and Lee, 2017 ). The characteristics of the three DNA encoding methods are shown in Table 1 .…”
Section: Basic Knowledge Of Dnamentioning
confidence: 99%
“…As the input_shape format of Conv3d required, the data dimensions were adjusted to suitable inputs using the transpose method. Finally, the input data is X, the label is Y, and the label Y is processed by one-hot encoding [25], which makes the feature calculation among features more reasonable and improves the computing speed. e calculation method is shown in Figure 4.…”
Section: Proposed Methodsmentioning
confidence: 99%
“…Filters slide over rows of the matrix (words), performing convolutions on the one-hot vector and generating feature maps. Since all neurons in the feature map scan the same feature of the previous layer but from different locations, different feature maps detect different types of features (Choong and Lee, 2017).…”
Section: Convolutional Layermentioning
confidence: 99%