Experimental multi-scalar measurements in laboratory flames have provided important databases for the validation of turbulent combustion closure models. In this work, we present a framework for databased closure in turbulent combustion and establish an a priori validation of this framework. The approach is based on the construction of joint probability density functions (PDFs) and conditional statistics using experimental data based on the parameterization of the composition space using principal component analysis (PCA). The PCA on the data identifies key parameters, principal components (PCs), on which joint scalar PDFs and conditional scalar means can be constructed. To enable a generic implementation for the construction of joint scalar PDFs, we use the multidimensional kernel density estimation (KDE) approach. An a priori validation of the PCA-KDE approach is carried out using the Sandia piloted jet turbulent flames D, E and F. The analysis of the results suggests that a few PCs are adequate to reproduce the statistics, resulting in a low-dimensional representation of the joint scalars PDFs and the scalars' conditional means. A reconstruction of the scalars' means and RMS statistics are in agreement of the corresponding statistics extracted directly from the experimental data. We also propose one strategy to recover missing species and construct conditional means for the reaction rates based on a variation of the pairwise-mixing stirred reactor (PMSR) model. The model is demonstrated using numerical simulations based on the one-dimensional turbulence (ODT) model for the same flames. (T. Echekki). 3 computational cost for complex chemical systems and large computational problems where a large number of notional particles must be retained for an accurate evaluation of the PDFs. Alternative strategies to alleviate the potential cost of a joint PDF transport equation have been proposed, including strategies to parameterize the composition space (e.g. by adopting a flamelet assumption) or via the multi-environment PDF method [10].The two limits of a presumed PDF shape with a limited set of parameters and the solution for a PDF transport equation leave ample room in between for intermediate approaches, which may be based on the construction of PDFs based on simulation [11][12][13][14] or experimental data. For example, Goldin and Menon [11,12], Sankaran et al [13] and Calhoon et al [14] built joint scalar PDFs, which are parameterized by an appropriate set of lower moments using simulations based on the linear-eddy model (LEM) [15].The PDFs are generally trained on simpler canonical flows, such as scalar decay in homogeneous turbulence or co-flowing jet configurations. The LEM is a 1-D model that can accommodate any degree of chemical complexity based on the coupling of reaction and diffusion processes in a deterministic fashion and a stochastic implementation for turbulent stirring. As the model is implemented in physical space, it is capable of generating statistics from which a description of the composition sp...