2021
DOI: 10.48550/arxiv.2110.13413
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Convergent Boosted Smoothing for Modeling Graph Data with Tabular Node Features

Abstract: For supervised learning with tabular data, decision tree ensembles produced via boosting techniques generally dominate real-world applications involving iid training/test sets. However for graph data where the iid assumption is violated due to structured relations between samples, it remains unclear how to best incorporate this structure within existing boosting pipelines. To this end, we propose a generalized framework for iterating boosting with graph propagation steps that share node/sample information acro… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(4 citation statements)
references
References 38 publications
0
4
0
Order By: Relevance
“…Leading GNN models fail to achieve competitive results for heterogeneous features with tabular or text node features [Ivanov and Prokhorenkova, 2021, Huang et al, 2020, Chen et al, 2021. To remedy this, Ivanov and Prokhorenkova [2021] jointly train Gradient Boosted Decision Trees (GBDT) and GNN in an end-to-end fashion, demonstrating a significant increase in performance on graph data with tabular node features.…”
Section: Graph Models With Multifaceted Node Featuresmentioning
confidence: 99%
See 3 more Smart Citations
“…Leading GNN models fail to achieve competitive results for heterogeneous features with tabular or text node features [Ivanov and Prokhorenkova, 2021, Huang et al, 2020, Chen et al, 2021. To remedy this, Ivanov and Prokhorenkova [2021] jointly train Gradient Boosted Decision Trees (GBDT) and GNN in an end-to-end fashion, demonstrating a significant increase in performance on graph data with tabular node features.…”
Section: Graph Models With Multifaceted Node Featuresmentioning
confidence: 99%
“…To remedy this, Ivanov and Prokhorenkova [2021] jointly train Gradient Boosted Decision Trees (GBDT) and GNN in an end-to-end fashion, demonstrating a significant increase in performance on graph data with tabular node features. Chen et al [2021] removes the need for a GNN altogether, proposing a generalized framework for iterating boosting with parameter-free graph propagation steps that share node/sample information across edges connecting related samples.…”
Section: Graph Models With Multifaceted Node Featuresmentioning
confidence: 99%
See 2 more Smart Citations