High throughput nonparametric probability density estimation

Farmer, Jenny; Jacobs, Donald J.

doi:10.1371/journal.pone.0196937

Cited by 19 publications

(23 citation statements)

References 55 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…As defined in Table 1, the proposed scoring functions include the relevant part of the Anderson-Darling (AD) measure [15], denoted as S AD , and the quasi log-likelihood formula [11], denoted as S LL . Note that…”

Section: Sample Size Invariant Scoring Functionsmentioning

confidence: 99%

“…A critical part of the algorithm in the PDFestimator [11] is that the input data sample is partitioned into hierarchical sub-samples by powers of 2 when N > 1025. Consequently, the employed scoring function should be sample size invariant for all partitions.…”

Section: Partition Size Invariancementioning

confidence: 99%

“…Second, developing an efficient algorithm to optimize the score while adaptively constructing a non-parametric pdf. The second part will be accomplished by an algorithm involving a non-parametric maximum entropy method (NMEM) that was recently developed by JF and DJ [11] and implemented as the "PDFestimator." Similar to a traditional parametric maximum entropy method (MEM), NMEM employs Lagrange multipliers as coefficients to orthogonal functions within a generalized Fourier series.…”

Section: Introductionmentioning

confidence: 99%

“…It is not necessary to have a second data set to compare. As described previously [11], the empirical quantile can be plotted on the y-axis versus the theoretical average quantile for the true pdf plotted on the x-axis. From single order statistics (SOS) the expectation value of r k is given by µ k = k/(N + 1) for k = 1, 2, ...N, which gives the mean quantile.…”

Section: Introductionmentioning

confidence: 99%

“…The QR-plot is scaled [11] in such a way as to make the scaled quantile residual (SQR) sample size invariant. From SOS, the standard deviation for the empirical quantile to deviate from the mean quantile is well-known to be…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

Universal Sample Size Invariant Measures for Uncertainty Quantification in Density Estimation

Farmer

Merino

Gray

et al. 2019

Entropy

Self Cite

View full text Add to dashboard Cite

Previously, we developed a high throughput non-parametric maximum entropy method (PLOS ONE, 13(5): e0196937, 2018) that employs a log-likelihood scoring function to characterize uncertainty in trial probability density estimates through a scaled quantile residual (SQR). The SQR for the true probability density has universal sample size invariant properties equivalent to sampled uniform random data (SURD). Alternative scoring functions are considered that include the Anderson-Darling test. Scoring function effectiveness is evaluated using receiver operator characteristics to quantify efficacy in discriminating SURD from decoy-SURD, and by comparing overall performance characteristics during density estimation across a diverse test set of known probability distributions.

show abstract

Section: Sample Size Invariant Scoring Functionsmentioning

confidence: 99%

Section: Partition Size Invariancementioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations