Abstract:In this paper, we present a theoretical foundation for a representation of a data set as a measure in a very large hierarchically parametrized family of positive measures, whose parameters can be computed explicitly (rather than estimated by optimization), and illustrate its applicability to a wide range of data types. The preprocessing step then consists of representing data sets as simple measures. The theoretical foundation consists of a dyadic product formula representation lemma, and a visualization theor… Show more
“…In [18] Kolaczyk and Nowak developed a systematic approach to multiscale probability models. They showed that multiscale factorizations, similar to Lemma 3.20 in [12] and the refomulation of it for dyadic measures on binary tree structured spaces, Lemma 2.1 in [2], arise when conditions for a "multiresolutionanalysis (MRA)" of likelihoods are satisfied and shown that these conditions characterize the Gaussian, Poisson and multinomial models. They also quantified the risk behavior of certain nonparametric, complexity penalized likelihood estimators based on their factorizations.…”
Section: Measures On Tree Structured Spacesmentioning
confidence: 97%
“…The product coefficient parameters uniquely determine the measure µ by the Dyadic Product Formula Representation ( Lemma 2.1) [2], even when the binary tree is infinite. The basic observation is that µ(S(n)) equals µ(D) multipled by the product of the factors from the root to a node n divided by 2 −scale(n) .…”
Section: Product Coefficients For Measures On Dyadic Setsmentioning
confidence: 99%
“…The two sets of product coefficients are related by a Bayes formula, so the new parameters are related by rational algebraic formulas to the original set of parameters. One quantitative formulation of this Bayes-type rule is given in Appendix 2 of version 2 of [2].…”
Section: Order-dependence Bayes Formulas and Invariant Measuresmentioning
confidence: 99%
“…The restricted set of dyadic measures that they used is very far from the case of general measures and in particular the L 2 condition they studied does not hold for multifractal measures which typically arise in for many real data sets. In [2] the author and collaborators realized the result could be re-formulated for dyadic measures on sets with binary tree structures (not just the unit interval) and used to provide an algorithmizable theoretically-based method for representing finite data samples from universes with a binary tree structure as vectors of product coefficient parameters of measures. They exploited the fact that the weak star limit result holds for the much more general class of non-negative dyadic measures.…”
Section: Measures On Tree Structured Spacesmentioning
confidence: 99%
“…In this paper we will describe a new method for representing a data set for which binary feature have been defined. The method algorithmically infers two related representations: a representation of the data set as a non-parametric multi-scale dyadic measure using the dyadic product formula representation [2] [12] [20] and a summary of the simplicial geometry of the support of the measure in terms of the betti numbers of a variant of a nerve simplicial complex determined by the dyadic measure. Both of these representations are based on mathematical theory.…”
“…In [18] Kolaczyk and Nowak developed a systematic approach to multiscale probability models. They showed that multiscale factorizations, similar to Lemma 3.20 in [12] and the refomulation of it for dyadic measures on binary tree structured spaces, Lemma 2.1 in [2], arise when conditions for a "multiresolutionanalysis (MRA)" of likelihoods are satisfied and shown that these conditions characterize the Gaussian, Poisson and multinomial models. They also quantified the risk behavior of certain nonparametric, complexity penalized likelihood estimators based on their factorizations.…”
Section: Measures On Tree Structured Spacesmentioning
confidence: 97%
“…The product coefficient parameters uniquely determine the measure µ by the Dyadic Product Formula Representation ( Lemma 2.1) [2], even when the binary tree is infinite. The basic observation is that µ(S(n)) equals µ(D) multipled by the product of the factors from the root to a node n divided by 2 −scale(n) .…”
Section: Product Coefficients For Measures On Dyadic Setsmentioning
confidence: 99%
“…The two sets of product coefficients are related by a Bayes formula, so the new parameters are related by rational algebraic formulas to the original set of parameters. One quantitative formulation of this Bayes-type rule is given in Appendix 2 of version 2 of [2].…”
Section: Order-dependence Bayes Formulas and Invariant Measuresmentioning
confidence: 99%
“…The restricted set of dyadic measures that they used is very far from the case of general measures and in particular the L 2 condition they studied does not hold for multifractal measures which typically arise in for many real data sets. In [2] the author and collaborators realized the result could be re-formulated for dyadic measures on sets with binary tree structures (not just the unit interval) and used to provide an algorithmizable theoretically-based method for representing finite data samples from universes with a binary tree structure as vectors of product coefficient parameters of measures. They exploited the fact that the weak star limit result holds for the much more general class of non-negative dyadic measures.…”
Section: Measures On Tree Structured Spacesmentioning
confidence: 99%
“…In this paper we will describe a new method for representing a data set for which binary feature have been defined. The method algorithmically infers two related representations: a representation of the data set as a non-parametric multi-scale dyadic measure using the dyadic product formula representation [2] [12] [20] and a summary of the simplicial geometry of the support of the measure in terms of the betti numbers of a variant of a nerve simplicial complex determined by the dyadic measure. Both of these representations are based on mathematical theory.…”
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.