We introduce a novel two-step approach for estimating a probability density function (pdf) given its samples, with the second and important step coming from a geometric formulation. The procedure involves obtaining an initial estimate of the pdf and then transforming it via a warping function to reach the final estimate. The initial estimate is intended to be computationally fast, albeit suboptimal, but its warping creates a larger, flexible class of density functions, resulting in substantially improved estimation. The search for optimal warping is accomplished by mapping diffeomorphic functions to the tangent space of a Hilbert sphere, a vector space whose elements can be expressed using an orthogonal basis. Using a truncated basis expansion, we estimate the optimal warping under a (penalized) likelihood criterion and, thus, the optimal density estimate. This framework is introduced for univariate, unconditional pdf estimation and then extended to conditional pdf estimation. The approach avoids many of the computational pitfalls associated with classical conditional-density estimation methods, without losing on estimation performance. We derive asymptotic convergence rates of the density estimator and demonstrate this approach using both synthetic datasets and real data, the latter relating to the association of a toxic metabolite on preterm birth.
Key words and phrases: conditional density; density estimation; warped density; Hilbert sphere; sieve estimation; tangent space; weighted likelihood maximization S Saoudi, A Hillion, and F Ghorbel. Non-parametric probability density function estimation on a bounded support: Applications to shape classification and speech coding. Applied Stochastic models and data analysis, 10(3):215-231, 1994. S Saoudi, F Ghorbel, and A Hillion. Some statistical properties of the kernel-diffeomorphism estimator. Applied stochastic models and data analysis, 13(1):39-58, 1997. Simon J Sheather and Michael C Jones. A reliable data-based bandwidth selection method for kernel density estimation. Journal of the Royal Statistical Society. Series B (Methodological), pages 683-690, 1991. Anuj Srivastava and Eric P Klassen. Functional and shape data analysis. Springer, 2016. EG Tabak and Cristina V Turner. A family of nonparametric density estimation algorithms. Communications on Pure and Applied