2018
DOI: 10.1371/journal.pone.0196937
|View full text |Cite
|
Sign up to set email alerts
|

High throughput nonparametric probability density estimation

Abstract: In high throughput applications, such as those found in bioinformatics and finance, it is important to determine accurate probability distribution functions despite only minimal information about data characteristics, and without using human subjectivity. Such an automated process for univariate data is implemented to achieve this goal by merging the maximum entropy method with single order statistics and maximum likelihood. The only required properties of the random variables are that they are continuous and … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
23
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
2

Relationship

1
5

Authors

Journals

citations
Cited by 19 publications
(23 citation statements)
references
References 55 publications
0
23
0
Order By: Relevance
“…As defined in Table 1, the proposed scoring functions include the relevant part of the Anderson-Darling (AD) measure [15], denoted as S AD , and the quasi log-likelihood formula [11], denoted as S LL . Note that…”
Section: Sample Size Invariant Scoring Functionsmentioning
confidence: 99%
See 4 more Smart Citations
“…As defined in Table 1, the proposed scoring functions include the relevant part of the Anderson-Darling (AD) measure [15], denoted as S AD , and the quasi log-likelihood formula [11], denoted as S LL . Note that…”
Section: Sample Size Invariant Scoring Functionsmentioning
confidence: 99%
“…A critical part of the algorithm in the PDFestimator [11] is that the input data sample is partitioned into hierarchical sub-samples by powers of 2 when N > 1025. Consequently, the employed scoring function should be sample size invariant for all partitions.…”
Section: Partition Size Invariancementioning
confidence: 99%
See 3 more Smart Citations