Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining 2022
DOI: 10.1145/3534678.3539312
|View full text |Cite
|
Sign up to set email alerts
|

Neural Bandit with Arm Group Graph

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 8 publications
0
4
0
Order By: Relevance
“…After kernel-based methods [13,30] were used to tackle the kernel-based reward mapping function under the non-linear settings, neural algorithms [5,41,43] have been proposed to utilize neural networks to estimate the reward function and confidence bound. Meanwhile, AGG-UCB [26] adopts GNN to model the arm group correlations. GCN-UCB [29] manages to apply the GNN model to embed arm contexts for the downstream linear regression, and GNN-PE [21] utilizes the UCB based on information gains to achieve exploration for classification tasks on graphs.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…After kernel-based methods [13,30] were used to tackle the kernel-based reward mapping function under the non-linear settings, neural algorithms [5,41,43] have been proposed to utilize neural networks to estimate the reward function and confidence bound. Meanwhile, AGG-UCB [26] adopts GNN to model the arm group correlations. GCN-UCB [29] manages to apply the GNN model to embed arm contexts for the downstream linear regression, and GNN-PE [21] utilizes the UCB based on information gains to achieve exploration for classification tasks on graphs.…”
Section: Related Workmentioning
confidence: 99%
“…𝑒 ) inspired by [7]. As it has proved that the confidence interval (uncertainty) of reward estimation can be expressed as a function of network gradients [26,43], we apply 𝑓 𝑖,𝑑 and two user nodes 𝑒, 𝑒 β€² , we let the edge weight be…”
Section: User Graph Estimation With User Networkmentioning
confidence: 99%
See 1 more Smart Citation
“…In contrast, our goal is to derive an NN-based algorithm in the non-parametric setting that performs well both empirically and theoretically. Neural contextual bandits [57,55,14,12,13,41] provide the principled method to balance between the exploitation and exploration [10,11]. [50] transforms active learning into neural contextual bandit problem and obtains a performance guarantee, of which limitations are discussed above.…”
Section: Related Workmentioning
confidence: 99%