2015
DOI: 10.1093/bib/bbv074
|View full text |Cite
|
Sign up to set email alerts
|

Discretization of gene expression data revised

Abstract: Gene expression measurements represent the most important source of biological data used to unveil the interaction and functionality of genes. In this regard, several data mining and machine learning algorithms have been proposed that require, in a number of cases, some kind of data discretization to perform the inference. Selection of an appropriate discretization process has a major impact on the design and outcome of the inference algorithms, as there are a number of relevant issues that need to be consider… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
39
0

Year Published

2015
2015
2020
2020

Publication Types

Select...
4
4
1

Relationship

0
9

Authors

Journals

citations
Cited by 57 publications
(43 citation statements)
references
References 28 publications
0
39
0
Order By: Relevance
“…While discretization always implies a loss of information, it also simplifies the state space of the problem considerably and it can deflate noisy data. Advantages and disadvantages of using discretized expression data are discussed in [32], for example.…”
Section: Methodsmentioning
confidence: 99%
“…While discretization always implies a loss of information, it also simplifies the state space of the problem considerably and it can deflate noisy data. Advantages and disadvantages of using discretized expression data are discussed in [32], for example.…”
Section: Methodsmentioning
confidence: 99%
“…On the absence of such knowledge that is normally difficult to be objectively determined and assessed, automated and statistically driven approaches are followed. Discretization of gene expression values is already followed in many microarray studies and respective data analysis approaches [112], [113]. …”
Section: Methodsmentioning
confidence: 99%
“…A necessary step for the formalization of data is to be able to express them in the terms of discrete activity levels [7]. The usual problem is to decide for a component, what is its threshold concentration, i.e.…”
Section: Measurementsmentioning
confidence: 99%
“…Here, we rely on an assumption that a fold change of two or more is significant, which is, to the best of our knowledge, a common practice and in our case produces a good separation. However, it might be worthwhile to revisit these experiments and evaluated how different discretization methods [7] would compare.…”
Section: Egfr Modelmentioning
confidence: 99%