Prasenjit Karmakar scite author profile

Prasenjit Karmakar

4Publications

88Citation Statements Received

118Citation Statements Given

How they've been cited

How they cite others

118

Affiliations

Technion – Israel Institute of Technology, Indian Institute of Science Bangalore

Publications

Order By: Most citations

Two Time-Scale Stochastic Approximation with Controlled Markov Noise and Off-Policy Temporal-Difference Learning

Karmakar

Bhatnagar

2018

Mathematics of OR

View full text Add to dashboard Cite

We present for the first time an asymptotic convergence analysis of two time-scale stochastic approximation driven by 'controlled' Markov noise. In particular, both the faster and slower recursions have non-additive controlled Markov noise components in addition to martingale difference noise. We analyze the asymptotic behavior of our framework by relating it to limiting differential inclusions in both time-scales that are defined in terms of the ergodic occupation measures associated with the controlled Markov processes. Finally, we present a solution to the off-policy convergence problem for temporal difference learning with linear function approximation, using our results.

show abstract

Two Timescale Stochastic Approximation with Controlled Markov noise and Off-policy temporal difference learning

Karmakar¹,

Bhatnagar²

2015

Preprint

View full text Add to dashboard Cite

Stochastic Approximation With Iterate-Dependent Markov Noise Under Verifiable Conditions in Compact State Space With the Stability of Iterates Not Ensured

Karmakar

Bhatnagar

2021

IEEE Trans. Automat. Contr.

View full text Add to dashboard Cite

Customer-Server Population Dynamics in Heavy Traffic

2022

View full text Add to dashboard Cite

We study a many-server queueing model with server vacations, where the population size dynamics of servers and customers are coupled: a server may leave for vacation only when no customers await, and the capacity available to customers is directly affected by the number of servers on vacation. We focus on scaling regimes in which server dynamics and queue dynamics fluctuate at matching time scales so that their limiting dynamics are coupled. Specifically, we argue that interesting coupled dynamics occur in (a) the Halfin–Whitt regime, (b) the nondegenerate slowdown regime, and (c) the intermediate near Halfin–Whitt regime, whereas the dynamics asymptotically decouple in the other heavy-traffic regimes. We characterize the limiting dynamics, which are different for each scaling regime. We consider relevant respective performance measures for regimes (a) and (b)—namely, the probability of wait and the slowdown. Although closed-form formulas for these performance measures have been derived for models that do not accommodate server vacations, it is difficult to obtain closed-form formulas for these performance measures in the setting with server vacations. Instead, we propose formulas that approximate these performance measures and depend on the steady-state mean number of available servers and previously derived formulas for models without server vacations. We test the accuracy of these formulas numerically.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.