2006
DOI: 10.1093/bib/bbk007
|View full text |Cite
|
Sign up to set email alerts
|

Machine learning in bioinformatics

Abstract: This article reviews machine learning methods for bioinformatics. It presents modelling methods, such as supervised classification, clustering and probabilistic graphical models for knowledge discovery, as well as deterministic and stochastic heuristics for optimization. Applications in genomics, proteomics, systems biology, evolution and text mining are also shown.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
433
0
9

Year Published

2008
2008
2017
2017

Publication Types

Select...
7
2

Relationship

1
8

Authors

Journals

citations
Cited by 743 publications
(443 citation statements)
references
References 228 publications
1
433
0
9
Order By: Relevance
“…Supervised methods utilize a prior knowledge about the system by developing classification models based on known spectra [62,[64][65][66]. These methods include Linear DA (LDA), Direct LDA (DLDA), Heteroscedastic LDA (HLDA), Nonparametric DA (NDA), Kernel-based LDA (K-LDA), SIMCA, PLS-DA and Multivariate Analysis of Variance (MANOVA) [67][68][69][70][71][72][73][74].…”
Section: Identification Of Unknown Body Fluidsmentioning
confidence: 99%
“…Supervised methods utilize a prior knowledge about the system by developing classification models based on known spectra [62,[64][65][66]. These methods include Linear DA (LDA), Direct LDA (DLDA), Heteroscedastic LDA (HLDA), Nonparametric DA (NDA), Kernel-based LDA (K-LDA), SIMCA, PLS-DA and Multivariate Analysis of Variance (MANOVA) [67][68][69][70][71][72][73][74].…”
Section: Identification Of Unknown Body Fluidsmentioning
confidence: 99%
“…Thus, the sum of the probabilities of all the transitions from a given states to all other states must be 1. Markov and HMMs are gaining popularity in bioinformatics research for nucleotide sequence analysis [10,12]. For prokaryotes gene identification, Borodovsky [15] effectively applied this HMM technique.…”
Section: Literature Surveymentioning
confidence: 99%
“…In this context, Biological data mining plays an important role to supply data to overcome provoking challenges in the process of research and development, thus enabling various possibilities in this direction [1]. Although the field of bioinformatics is originally aimed to extract information embedded within the three billion bases of human DNA, the field has further evolved to understand its capability and capacity for studying information contents and information flow of biological systems and processes.…”
Section: Introductionmentioning
confidence: 99%
“…The conditional probabilities learnt through the phenotype statistical distribution in the database will be used to report interactions among genes not only based on their individual expression levels, but also on their behaviour through the different conditions. This fact involves the addition of the probabilistic relationship that associates the sample class (or phenotype) with each relevant gene or feature under the study, that is, a supervised-class experimental design [18,19]. Our new proposal belongs to these supervised studies, stressing the search of robust results by means of a hierarchy of supervised Bayesian classifiers.…”
Section: Introductionmentioning
confidence: 99%