Abstract:International audience
For a given matrix of size $n \times m$ over a finite alphabet $\mathcal{A}$, a bicluster is a submatrix composed of selected columns and rows satisfying a certain property. In microarrays analysis one searches for largest biclusters in which selected rows constitute the same string (pattern); in another formulation of the problem one tries to find a maximally dense submatrix. In a conceptually similar problem, namely the bipartite clique problem on graphs, one looks for the la… Show more
“…Analyzing computational data, Lonardi, Szpankowski, and Yang [25,26] conjectured the shape of the 1-rectangles. The conjecture was proven by Park and Szpankowski [29]. Their proof can be formulated as follows: Let f : X × Y → {0, 1} be a random Boolean function with parameter p.…”
Section: Largest 1-rectanglementioning
confidence: 97%
“…The size of the largest monochromatic rectangle in a random Bernoulli matrix was determined in [29] when p is bounded away from 0 and 1, but their technique fails for p → 1.…”
Section: Relationship To Related Workmentioning
confidence: 99%
“…The size of the largest monochromatic rectangle is of interest in the analysis of gene expression data [29], and formal concept analysis [6].…”
Section: Relationship To Related Workmentioning
confidence: 99%
“…Proving that no larger ones exist requires some work. The problem with the union-bound based proof in [29] is that it breaks down if p tends to 1 moderately quickly. In our proofs, we work with strong tail bounds instead.…”
Section: Largest 1-rectanglementioning
confidence: 99%
“…Sizes of square 1-rectangles have been studied, too. Building on work in [7,6,29], it was settled in [32], for constant p. We need results for p → 0, 1, but, fortunately, for our theorem, we only require weak upper bounds.…”
We study nondeterministic communication complexity and related concepts (fooling sets, fractional covering number) of random functions f : X × Y → {0, 1} where each value is chosen to be 1 independently with probability p = p(n), n := |X| = |Y |.
“…Analyzing computational data, Lonardi, Szpankowski, and Yang [25,26] conjectured the shape of the 1-rectangles. The conjecture was proven by Park and Szpankowski [29]. Their proof can be formulated as follows: Let f : X × Y → {0, 1} be a random Boolean function with parameter p.…”
Section: Largest 1-rectanglementioning
confidence: 97%
“…The size of the largest monochromatic rectangle in a random Bernoulli matrix was determined in [29] when p is bounded away from 0 and 1, but their technique fails for p → 1.…”
Section: Relationship To Related Workmentioning
confidence: 99%
“…The size of the largest monochromatic rectangle is of interest in the analysis of gene expression data [29], and formal concept analysis [6].…”
Section: Relationship To Related Workmentioning
confidence: 99%
“…Proving that no larger ones exist requires some work. The problem with the union-bound based proof in [29] is that it breaks down if p tends to 1 moderately quickly. In our proofs, we work with strong tail bounds instead.…”
Section: Largest 1-rectanglementioning
confidence: 99%
“…Sizes of square 1-rectangles have been studied, too. Building on work in [7,6,29], it was settled in [32], for constant p. We need results for p → 0, 1, but, fortunately, for our theorem, we only require weak upper bounds.…”
We study nondeterministic communication complexity and related concepts (fooling sets, fractional covering number) of random functions f : X × Y → {0, 1} where each value is chosen to be 1 independently with probability p = p(n), n := |X| = |Y |.
Abstract-Microarray data processing revolves around the pivotal issue of locating genes altering their expression in response to pathogens, other organisms or other multiple environmental conditions resulted out of a comparison between infected and uninfected cells or tissues. To have a comprehensive analysis of the corollaries of certain treatments, deseases and developmental stages embodied as a data matrix on gene expression data is possible through simultaneous observation and monitoring of the expression levels of multiple genes. Clustering is the mechanism of grouping genes into clusters based on different parameters. Clustering is the process of grouping genes into clusters either considering row at a time(row clustering) or considering column at a time(column clustering). The application of clustering approach is crippled by conditions which are unrelated to genes. To get better of these problems a unique form of clustering technique has evolved which offers simultaneous clustering (both rows and columns) which is known as biclustering. A bicluster is deemed to be a sub matrix consisting data values. A bicluster is resulted out of the removal of some of the rows as well as some of the columns of given data matrix in such a fashion that each row of what is left reads the same string. A fast, simple and efficient randomized algorithm is explored in this paper, which discovers the largest bicluster by random projections.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.