Summary. Uniform confidence bands for densities f via non-parametric kernel estimates were first constructed by Bickel and Rosenblatt. In this paper this is extended to confidence bands in the deconvolution problem g D f * ψ for an ordinary smooth error density ψ. Under certain regularity conditions, we obtain asymptotic uniform confidence bands based on the asymptotic distribution of the maximal deviation (L 1 -distance) between a deconvolution kernel estimator f and f. Further consistency of the simple non-parametric bootstrap is proved. For our theoretical developments the bias is simply corrected by choosing an undersmoothing bandwidth. For practical purposes we propose a new data-driven bandwidth selector that is based on heuristic arguments, which aims at minimizing the L 1 -distance betweenf and f . Although not constructed explicitly to undersmooth the estimator, a simulation study reveals that the bandwidth selector suggested performs well in finite samples, in terms of both area and coverage probability of the resulting confidence bands. Finally the methodology is applied to measurements of the metallicity of local F and G dwarf stars. Our results confirm the 'G dwarf problem', i.e. the lack of metal poor G dwarfs relative to predictions from 'closed box models' of stellar formation.
We study the approximation of arbitrary distributions P on d-dimensional space by distributions with log-concave density. Approximation means minimizing a Kullback-Leibler-type functional. We show that such an approximation exists if and only if P has finite first moments and is not supported by some hyperplane. Furthermore we show that this approximation depends continuously on P with respect to Mallows distance D1(·, ·). This result implies consistency of the maximum likelihood estimator of a log-concave density under fairly general conditions. It also allows us to prove existence and consistency of estimators in regression models with a response Y = µ(X) + ε, where X and ε are independent, µ(·) belongs to a certain class of regression functions while ε is a random error with logconcave density and mean zero.
Summary.A general method for exploring multivariate data by comparing different estimates of multivariate scatter is presented. The method is based on the eigenvalue-eigenvector decomposition of one scatter matrix relative to another. In particular, it is shown that the eigenvectors can be used to generate an affine invariant co-ordinate system for the multivariate data. Consequently, we view this method as a method for invariant co-ordinate selection. By plotting the data with respect to this new invariant co-ordinate system, various data structures can be revealed. For example, under certain independent components models, it is shown that the invariant coordinates correspond to the independent components. Another example pertains to mixtures of elliptical distributions. In this case, it is shown that a subset of the invariant co-ordinates corresponds to Fisher's linear discriminant subspace, even though the class identifications of the data points are unknown. Some illustrative examples are given.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.