2019
DOI: 10.1109/access.2018.2890096
|View full text |Cite
|
Sign up to set email alerts
|

An Encoding Scheme Capturing Generic Priors and Properties of Amino Acids Improves Protein Classification

Abstract: Feature engineering aims at representing non-numeric data with numeric features that keep the essential information of the underlying problem, and it is a non-trivial process in building a predictive model. In bioinformatics, there is a profound scale of DNA and protein sequences available, but far from being fully utilized. Computational models can facilitate the analyses of large-scale data. However, most computational models require a numeric representation as input. Expert knowledge can help design feature… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1
1

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 34 publications
0
2
0
Order By: Relevance
“…Section 4.2.2 presents how CFreeEnS works in detail, while Section 4.2.3 provides a general method for evaluating a computational model. Work of this section is based on publications Zhou et al (2018b) and Zhou et al (2018c). .1 shows a typical pipeline of computational modeling can be divided into four modules: data retrieval, feature engineering, modeling, and evaluation.…”
Section: Cfreeens: a Context-free Encoding Scheme Of Protein Sequencesmentioning
confidence: 99%
See 1 more Smart Citation
“…Section 4.2.2 presents how CFreeEnS works in detail, while Section 4.2.3 provides a general method for evaluating a computational model. Work of this section is based on publications Zhou et al (2018b) and Zhou et al (2018c). .1 shows a typical pipeline of computational modeling can be divided into four modules: data retrieval, feature engineering, modeling, and evaluation.…”
Section: Cfreeens: a Context-free Encoding Scheme Of Protein Sequencesmentioning
confidence: 99%
“…The encoding scheme has been applied to different tasks of protein classification, as well as measuring the phenotype similarity between proteins, resulting in better performance than other traditional schemes. Details about the application of CFreeEnS on protein classification are presented in Appendix A (Zhou et al, 2018c).…”
Section: Cfreeens For Protein Sequences and Protein Sequence Pairsmentioning
confidence: 99%