2001
DOI: 10.1007/978-0-387-21606-5
|View full text |Cite
|
Sign up to set email alerts
|

The Elements of Statistical Learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

38
9,873
2
169

Year Published

2004
2004
2017
2017

Publication Types

Select...
9

Relationship

0
9

Authors

Journals

citations
Cited by 14,574 publications
(10,082 citation statements)
references
References 116 publications
38
9,873
2
169
Order By: Relevance
“…Additional simulations using small number of markers and pseudo-ancestors indicate that structure usually produces a slightly smaller sampling variance compared to the MLE, but at the expense of being substantially biased. The comparison of the RMSE for the two estimates therefore depends on the bias-variance trade-off (Hastie et al 2001). …”
Section: Discussionmentioning
confidence: 99%
“…Additional simulations using small number of markers and pseudo-ancestors indicate that structure usually produces a slightly smaller sampling variance compared to the MLE, but at the expense of being substantially biased. The comparison of the RMSE for the two estimates therefore depends on the bias-variance trade-off (Hastie et al 2001). …”
Section: Discussionmentioning
confidence: 99%
“…Accordingly, the kernel width of 10 is excessively large for these data. In any case, the statistical properties of the Nadaraya-Watson estimator can be found in Hastie et al (2001).…”
Section: Discussionmentioning
confidence: 99%
“…Therefore, a way to quantify the representativeness of a region is to estimate the probability that the relative fit will fall within that region. One effective method is to use Gaussian kernels (e.g., Hastie, Tibshirani & Friedman, 2001), in which an unknown probability distribution is approximated by a mixture of a large number of normal distributions, one for each data set in the landscape (see Appendix B). The result is a distribution of fit representativeness, with each point in the landscape having an associated probability 4 .…”
Section: Defining Representativenessmentioning
confidence: 99%
“…With dendrograms resulting from the analysis of tumor specimens, bio-medical knowledge is fundamental to select a proper "cut", but also in this case an "objective" and data-driven assessment of the reliability of the clusters may be useful to support bio-medical decisions. Even when clusters and cluster boundaries are univocally defined by the clustering algorithms, such as in K-means [21], or in Self-Organizing-Maps [22], the number of clusters must be chosen a priori and the results depend on the initial conditions. Other methods based on a Bayesian paradigm that combines a priori knowledge with observational data, can automatically select the "optimal" number of clusters, but their accuracy is decremented when small samples are used, due to their asymptotic assumptions [23,24].…”
Section: Introductionmentioning
confidence: 99%