International Journal of Intelligent Computing and Cybernetics

2009

DOI: 10.1108/17563780910982707

|View full text |Cite

|

Sign up to set email alerts

|

Improved biclustering on expression data through overlapping control

¹

,

Federico Divina

²

,

Raúl Giráldez

³

et al.

Abstract: PurposeThe purpose of this paper is to present a novel control mechanism for avoiding overlapping among biclusters in expression data.Design/methodology/approachBiclustering is a technique used in analysis of microarray data. One of the most popular biclustering algorithms is introduced by Cheng and Church (2000) (Ch&Ch). Even if this heuristic is successful at finding interesting biclusters, it presents several drawbacks. The main shortcoming is that it introduces random values in the expression matrix to con… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Introduction3

Results Using Synthetic Data2

Citation Types

Supporting

0

Mentioning

15

Contrasting

0

Year Published

2011

2011

2024

2024

Publication Types

Select...

Other4

Article2

Book1

Relationship

Self Cite0

Independent7

Authors

Journals

Cited by 14 publications

(15 citation statements)

References 17 publications

Supporting

0

Mentioning

15

Contrasting

0

Order By: Relevance

“…For this particular work, we have simulated data from 5 different time points and 10 conditions using microarrays containing 1000 genes. Each gene is assigned a random value which is contained in the rank, respectively for each condition, [1,15], [7,35] [10,30]. In such data set, we have allocated a tricluster with all its values fixed to 1.…”

Section: Results Using Synthetic Datamentioning

confidence: 99%

“…Traditional clustering algorithms work on the whole space of data dimensions examining each gene in the dataset under all conditions tested. Biclustering techniques [8] go a step further by relaxing the conditions and by allowing assessment only under a subset of the conditions of the experiment, and it has proved to be successful finding gene patterns [6,10]. However, clustering and biclustering are insufficient when analyzing data from microarray experiments where attention is payed on how the time affects gene's behavior.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Unravelling the Yeast Cell Cycle Using the TriGen Algorithm

Gutiérrez‐Avilés

¹

,

²

,

³

2011

Advances in Artificial Intelligence

View full text Add to dashboard Cite

Abstract. Analyzing microarray data represents a computational challenge due to the characteristics of these data. Clustering techniques are widely applied to create groups of genes that exhibit a similar behavior under the conditions tested. Biclustering emerges as an improvement of classical clustering since it relaxes the constraints for grouping allowing genes to be evaluated only under a subset of the conditions and not under all of them. However, this technique is not appropriate for the analysis of temporal microarray data in which the genes are evaluated under certain conditions at several time points. In this paper, we present the results of applying the TriGen algorithm, a genetic algorithm that finds triclusters that take into account the experimental conditions and the time points, to the yeast cell cycle problem, where the goal is to identify all genes whose expression levels are regulated by the cell cycle.

“…For this particular work, we have simulated data from 5 different time points and 10 conditions using microarrays containing 1000 genes. Each gene is assigned a random value which is contained in the rank, respectively for each condition, [1,15], [7,35] [10,30]. In such data set, we have allocated a tricluster with all its values fixed to 1.…”

Section: Results Using Synthetic Datamentioning

confidence: 99%

“…Traditional clustering algorithms work on the whole space of data dimensions examining each gene in the dataset under all conditions tested. Biclustering techniques [8] go a step further by relaxing the conditions and by allowing assessment only under a subset of the conditions of the experiment, and it has proved to be successful finding gene patterns [6,10]. However, clustering and biclustering are insufficient when analyzing data from microarray experiments where attention is payed on how the time affects gene's behavior.…”

Section: Introductionmentioning

confidence: 99%

Unravelling the Yeast Cell Cycle Using the TriGen Algorithm

Gutiérrez‐Avilés

¹

,

²

,

³

2011

Advances in Artificial Intelligence

View full text Add to dashboard Cite

Abstract. Analyzing microarray data represents a computational challenge due to the characteristics of these data. Clustering techniques are widely applied to create groups of genes that exhibit a similar behavior under the conditions tested. Biclustering emerges as an improvement of classical clustering since it relaxes the constraints for grouping allowing genes to be evaluated only under a subset of the conditions and not under all of them. However, this technique is not appropriate for the analysis of temporal microarray data in which the genes are evaluated under certain conditions at several time points. In this paper, we present the results of applying the TriGen algorithm, a genetic algorithm that finds triclusters that take into account the experimental conditions and the time points, to the yeast cell cycle problem, where the goal is to identify all genes whose expression levels are regulated by the cell cycle.

“…For this particular work, we have simulated data from 5 different time points and 10 conditions using microarrays containing 1000 genes. Each gene is assigned a random value which is contained in the rank, respectively for each condition, [1,15], [7,35] [10,30]. In such data set, we have allocated a tricluster with all its values fixed to 1.…”

Section: A Results Using Synthetic Datamentioning

confidence: 99%

“…Traditional clustering algorithms work on the whole space of data dimensions examining each gene in the dataset under all conditions tested. Biclustering techniques [5] go a step further by relaxing the conditions and by allowing assessment only under a subset of the conditions of the experiment, and it has proved to be successful finding gene patterns [6], [7]. However, clustering and biclustering …”

Section: Introductionmentioning

confidence: 99%

Revisiting the yeast cell cycle problem with the improved TriGen algorithm

Gutiérrez‐Avilés

¹

,

²

,

³

2011

2011 Third World Congress on Nature and Biologically Inspired Computing

View full text Add to dashboard Cite

No abstract

“…Traditional clustering algorithms work on the whole space of data dimensions examining each gene in the dataset under all conditions tested. Biclustering techniques [5] go a step further by relaxing the conditions and by allowing assessment only under a subset of the conditions of the experiment, and it has proved to be successful finding gene patterns [6], [7]. However, clustering and biclustering are insufficient when analyzing data from microarray experiments where attention is payed on how the time affects gene's behavior.…”

Section: Introductionmentioning

confidence: 99%

Triclustering on temporary microarray data using the TriGen algorithm

Gutiérrez‐Avilés

¹

,

²

,

³

2011

2011 11th International Conference on Intelligent Systems Design and Applications

View full text Add to dashboard Cite

and riquelme@us.es is important, for example, cell cycles, development at the molecular level or evolution of diseases [8]. Therefore is necessary to develop specific tools for data analysis in which genes are evaluated under certain conditions considering the time factor. In this context we present the TriGen algorithm, which goes a step further than clustering and biclustering techniques in the creation of groups of pattern similarity for genes. TriGen works on a three-dimensional space, thus taking into account the time factor, and allowing the evaluation of the behavior of genes only under certain conditions and only under certain time points. TriGen applies an evolutionary technique, genetic algorithms, to find solutions that we refer to as triclusters. Other works related with this approach are in [9] and [10]. The rest of the paper is structured as follows: Section II describes the algorithm in detail, Section III shows the results using both synthetic and real data. Section IV summarizes the conclusions reached and proposals for future work. II. METHODOLOGYWe describe the implementation of the TriGen algorithm. In this section we explain the inputs and outputs of the algorithm and we provide a detailed description of the evolutionary process and all the operators implied. A. Input dataThe input data is obtained from temporal microarray experiments. Each of these microarrays reveals the expression level under specific experimental conditions and at an instant of time. Therefore, the input data consists of T number of microarrays, as many as time points to be analyzed. Each value of a microarray for an specific time t represents the level of gene expression of a gene g under a specific experimental condition c. B. Definition of TriclusterWe define a tricluster as a subset of time points T , a subset of genes G and a subset of conditions C extracted from the input data. In this particular work, each tricluster contains the expression values of the these three sets and a fitness value that indicates the tricluster's quality. The fitness function will be described in detail in Section II-C6. Qualitatively, a tricluster will provide information on behavior pattern of a 877Abstract-The analysis of microarray data is a computational challenge due to the characteristics of these data. Clustering techniques are widely applied to create groups of genes that exhibit a similar behavior under the conditions tested. Biclustering emerges as an improvement of classical clustering since it relaxes the constraints for grouping allowing genes to be evaluated only under a subset of the conditions and not under all of them. However, this technique is not appropriate for the analysis of temporal microarray data in which the genes are evaluated under certain conditions at several time points. In this paper, we propose the TriGen algorithm, which finds triclusters that take into account the experimental conditions and the time points, using evolutionary computation, in particular genetic algorithms, enabling the evaluation of th...

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Product

Browser Extension Assistant by scite Citation Statement Search Reference Check Visualizations Dashboards Explore Journals Explore Organizations Explore Funders Embedding Badge Embedding Citation Search Pricing

Resources

Blog Help & FAQ Accessibility Statement API Terms For Universities & Governments For Researchers For Publishers For Corporate, Pharma & Enterprise Author Marketing Become an Affiliate Get an organization trial or quote scite Data & Services

About

News & Press Careers Read our Paper Coverage

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Copyright © 2024 scite LLC. All rights reserved.

Made with 💙 for researchers

Part of the Research Solutions Family.