2010
DOI: 10.4018/jkdb.2010040102
|View full text |Cite
|
Sign up to set email alerts
|

Clustering Genes Using Heterogeneous Data Sources

Abstract: Clustering of gene expression data is a standard exploratory technique used to identify closely related genes. Many other sources of data are also likely to be of great assistance in the analysis of gene expression data. Such sources include proteinprotein interaction data, transcription factor and regulatory elements data, comparative genomics data, protein expression data and much more. These data provide us with a means to begin elucidating the large-scale modular organization of the cell. Conclusions drawn… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
5
0

Year Published

2014
2014
2023
2023

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(5 citation statements)
references
References 30 publications
0
5
0
Order By: Relevance
“…Multimodal fusion for biological data is found in [3] for integration of gene expression data with text information and in [130] for metabolomics.…”
Section: Other Fields Of Application Of Tensor Fusionmentioning
confidence: 99%
See 1 more Smart Citation
“…Multimodal fusion for biological data is found in [3] for integration of gene expression data with text information and in [130] for metabolomics.…”
Section: Other Fields Of Application Of Tensor Fusionmentioning
confidence: 99%
“…Recent international efforts are marshalling the use of a bewildering array of different technologies to acquire high-throughput multimodal information about real-world systems. Examples of the systems and modalities probed are the Internet [1], geophysical data [2], and the human genome [3].…”
Section: Introductionmentioning
confidence: 99%
“…With the progress of powerful sensors and computer technologies, unprecedented massive data (multi‐dimensional data) are exponentially grown in many areas, such as Internet [1], geophysical data [2] and human genome [3] and so on. The processing technologies of these data have become a hot research topic, especially, dimensionality reduction (DR) is one of the most important technology [4].…”
Section: Introductionmentioning
confidence: 99%
“…While multi-similarity clustering applied to Unsupervised Object Discovery is a novelty of this work, other fields (e.g., bioinformatics) commonly use multi-similarity clustering to combine multiple heterogeneous data sources. Zeng et al (2010) combine gene expression data, text, and clustering constraints induced by the text data, to identify closely related genes. Zeng et al (2010) use a variant of EM in which parameter estimation and cluster reassignment are performed over a single data source picked at random at each iteration.…”
Section: Related Workmentioning
confidence: 99%
“…Zeng et al (2010) combine gene expression data, text, and clustering constraints induced by the text data, to identify closely related genes. Zeng et al (2010) use a variant of EM in which parameter estimation and cluster reassignment are performed over a single data source picked at random at each iteration. Troyanskaya et al (2003) introduce a Bayesian framework to cluster protein-protein interaction patterns based on multiple sources of protein relations.…”
Section: Related Workmentioning
confidence: 99%