2010
DOI: 10.1007/978-3-642-16001-1_20
|View full text |Cite
|
Sign up to set email alerts
|

Biologically-aware Latent Dirichlet Allocation (BaLDA) for the Classification of Expression Microarray

Abstract: Topic models have recently shown to be really useful tools for the analysis of microarray experiments. In particular they have been successfully applied to gene clustering and, very recently, also to samples classification. In this latter case, nevertheless, the basic assumption of functional independence between genes is limiting, since many other a priori information about genes' interactions may be available (co-regulation, spatial proximity or other a priori knowledge). In this paper a novel topic model is… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
21
0

Year Published

2011
2011
2019
2019

Publication Types

Select...
4
2

Relationship

3
3

Authors

Journals

citations
Cited by 21 publications
(21 citation statements)
references
References 20 publications
0
21
0
Order By: Relevance
“…One clear example is represented by the topic models [14,5], initially designed for text mining and computer vision applications, and recently successfully applied in the microarray context [23,3,2]. Clearly, the peculiar context may lead to substantial changes in the model so to improve results [21].…”
Section: Introductionmentioning
confidence: 99%
“…One clear example is represented by the topic models [14,5], initially designed for text mining and computer vision applications, and recently successfully applied in the microarray context [23,3,2]. Clearly, the peculiar context may lead to substantial changes in the model so to improve results [21].…”
Section: Introductionmentioning
confidence: 99%
“…PLSA may be very useful in the expression microarray context, since it may provide powerful and interpretable descriptions of experiments [3,22,24]. In particular there is an analogy between the pairs word-document and gene-sample: actually it is reasonable to intend the samples as documents and the genes as words.…”
Section: Probabilistic Latent Semantic Analysismentioning
confidence: 99%
“…[17,29]. Among others, in recent years some promising techniques were based on a particular class of probabilistic approaches, called topic models, showing optimal and highly interpretable results [2,22,24]. Such probabilistic topic models, the two most famous examples being the Probabilistic Latent Semantic Analysis (PLSA [15]) and the Latent Dirichlet Allocation (LDA [5]), have been imported from the text analysis realm as workhorses in several scientific fields [6,8,33].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…In particular, we consider learning models for count data, a prominent type of data that arises in bag-of-words representations of text documents, in bioinformatics for example as counts of active genes over pathways, and in other domains. Latent structure in count data has often been modeled with topic models [1], in domains from document collections [2] to bioinformatics [3,4].…”
Section: Introductionmentioning
confidence: 99%