2020
DOI: 10.48550/arxiv.2005.14458
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Distributional Random Forests: Heterogeneity Adjustment and Multivariate Distributional Regression

Abstract: We propose an adaptation of the Random Forest algorithm to estimate the conditional distribution of a possibly multivariate response. We suggest a new splitting criterion based on the MMD two-sample test, which is suitable for detecting heterogeneity in multivariate distributions. The weights provided by the forest can be conveniently used as an input to other methods in order to locally solve various learning problems. The code is available as R-package drf.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(8 citation statements)
references
References 12 publications
0
8
0
Order By: Relevance
“…This implies that the continuum estimator proposed in Section 4.1, as a convex combination of Lipschitz functions, is also Lipschitz. Point-wise rates provided in [ Ćevid et al, 2020, Oprescu et al, 2019, Athey et al, 2019, Györfi et al, 2002 can then be translated into uniform rates, thanks to this Lipschitz property (see Lemma D.10).…”
Section: Regret Guaranteesmentioning
confidence: 99%
See 1 more Smart Citation
“…This implies that the continuum estimator proposed in Section 4.1, as a convex combination of Lipschitz functions, is also Lipschitz. Point-wise rates provided in [ Ćevid et al, 2020, Oprescu et al, 2019, Athey et al, 2019, Györfi et al, 2002 can then be translated into uniform rates, thanks to this Lipschitz property (see Lemma D.10).…”
Section: Regret Guaranteesmentioning
confidence: 99%
“…Here, ω i (s, a) roughly measures the proximity of the i th datapoint to the query point (s, a), so it is typically larger when (s i , a i ) is closer to (s, a). Common weight construction methods include k-nearest neighbors, kernel regressions, decision trees and various tree ensembles [Bertsimas and Kallus, 2020, Ćevid et al, 2020, Khosravi et al, 2019, Oprescu et al, 2019, Meinshausen and Ridgeway, 2006, Athey et al, 2019. With these weights, we can approximate f 0 (s, a; α) for any α with the following continuum estimator:…”
Section: Estimating a Continuum Of Nuisancesmentioning
confidence: 99%
“…Chagny [17] used an expansion of a "warped" conditional density onto a space spanned by orthonormal bases. Recently, Ćevid et al [18] studied conditional density estimation using an adapted Random Forest algorithm. In conclusion, although there has been some work on conditional density estimation, the minimax optimal rate is an open question.…”
Section: Relevant Literaturementioning
confidence: 99%
“…Chagny [18] used an expansion of a "warped" conditional density onto a space spanned by orthonormal bases. Recently, Ćevid et al [19] studied conditional density estimation using an adapted Random Forest algorithm. In conclusion, although there has been some work on conditional density estimation, the minimax optimal rate is an open question.…”
Section: Relevant Literaturementioning
confidence: 99%