Jung-hun Kim scite author profile

In this paper we study a multi-class, multi-server queueing system with stochastic rewards of job-server assignments following a bilinear model in feature vectors representing jobs and servers. Our goal is regret minimization against an oracle policy that has a complete information about system parameters. We propose a scheduling algorithm that uses a linear bandit algorithm along with dynamic allocation of jobs to servers. For the baseline setting, in which mean job service times are identical for all jobs, we show that our algorithm has a sub-linear regret, as well as a sub-linear bound on the mean queue length, in the horizon time. We further show that similar bounds hold under more general assumptions, allowing for non-identical mean job service times for different job classes and a time-varying set of server classes. We also show that better regret and mean queue length bounds can be guaranteed by an algorithm having access to traffic intensities of job classes. We present results of numerical experiments demonstrating how regret and mean queue length of our algorithms depend on various system parameters and compare their performance against a previously proposed algorithm using synthetic randomly generated data and a real-world cluster computing data trace.

show abstract

A Study on the Methods to Improve Hangul Literacy of Primary School Entrants

Lee¹,

Soonkyung²,

Kim³

2017

J. Curric. Eval.

View full text Add to dashboard Cite

Rotting infinitely many-armed bandits

Kim¹,

Vojnović²,

Yun³

2022

Preprint

View full text Add to dashboard Cite

Adversarial Bandits Robust to Switching Targets

Kim¹,

Yun²

2022

Preprint

View full text Add to dashboard Cite

We study the adversarial bandit problem under S number of switching best arms for unknown S. For handling this problem, we adopt the master-base framework using the online mirror descent method (OMD). We first provide a master-base algorithm with basic OMD, achieving Õ(S 1/2 K 1/3 T 2/3 ). For improving the regret bound with respect to T , we propose to use adaptive learning rates for OMD to control variance of loss estimators, and achieve Õ(min{E[ SKT ρ T (h † )], S √ KT }), where ρ T (h † ) is a variance term for loss estimators.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Jung-hun Kim

Contextual Linear Bandits under Noisy Features: Towards Bayesian Oracles

Scheduling Servers with Stochastic Bilinear Rewards

A Study on the Methods to Improve Hangul Literacy of Primary School Entrants

Rotting infinitely many-armed bandits

Adversarial Bandits Robust to Switching Targets

Contact Info

Product

Resources

About