On the sample complexity of actor-critic method for reinforcement learning with function approximation

Kumar, Harshat; Koppel, Alec; Ribeiro, Alejandro

doi:10.1007/s10994-023-06303-2

Cited by 33 publications

(13 citation statements)

References 55 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For example, Kumar et al. (2019) provided a convergence rate analysis for a nested‐loop Actor–Critic algorithm to the stationary point through quantifying the smallest number of actor updates k required to attain

\inf _{0\le m\le k}\Vert \nabla J(\theta ^{(k)})\Vert ^2&lt; \varepsilon

. We denote this smallest number as K .…”

Section: Discussionmentioning

confidence: 99%

“…There are three main ways to execute the algorithm. In the nested‐loop setting (see, e.g., Kumar et al., 2019; Xu et al., 2020a), the actor updates the policy in the outer loop after the critic's repeated updates in the inner loop. The second way is the two time‐scale setting (see, e.g., Xu et al., 2020b), where the actor and the critic update their parameters simultaneously with different learning rates.…”

Section: The Basics Of Reinforcement Learningmentioning

confidence: 99%

See 1 more Smart Citation

Recent advances in reinforcement learning in finance

Hambly

Yang

2023

Mathematical Finance

View full text Add to dashboard Cite

The rapid changes in the finance industry due to the increasing amount of data have revolutionized the techniques on data processing and data analysis and brought new theoretical and computational challenges. In contrast to classical stochastic control theory and other analytical approaches for solving financial decision‐making problems that heavily reply on model assumptions, new developments from reinforcement learning (RL) are able to make full use of the large amount of financial data with fewer model assumptions and to improve decisions in complex financial environments. This survey paper aims to review the recent developments and use of RL approaches in finance. We give an introduction to Markov decision processes, which is the setting for many of the commonly used RL approaches. Various algorithms are then introduced with a focus on value‐ and policy‐based methods that do not require any model assumptions. Connections are made with neural networks to extend the framework to encompass deep RL algorithms. We then discuss in detail the application of these RL algorithms in a variety of decision‐making problems in finance, including optimal execution, portfolio optimization, option pricing and hedging, market making, smart order routing, and robo‐advising. Our survey concludes by pointing out a few possible future directions for research.

show abstract

\inf _{0\le m\le k}\Vert \nabla J(\theta ^{(k)})\Vert ^2&lt; \varepsilon

. We denote this smallest number as K .…”

Section: Discussionmentioning

confidence: 99%

Section: The Basics Of Reinforcement Learningmentioning

confidence: 99%

Recent advances in reinforcement learning in finance

Hambly

Yang

2023

Mathematical Finance

View full text Add to dashboard Cite

show abstract

“…Actor-critic methods [188] Actor-critic methods combine policy gradient and value function estimation. The actor learns the policy, while the critic estimates the value function to evaluate the policy's performance.…”

Section: Q-learning [185]mentioning

confidence: 99%

A Comprehensive Survey of Machine Learning Methodologies with Emphasis in Water Resources Management

Drogkoula,

Kokkinos,

Samaras

2023

Applied Sciences

View full text Add to dashboard Cite

This paper offers a comprehensive overview of machine learning (ML) methodologies and algorithms, highlighting their practical applications in the critical domain of water resource management. Environmental issues, such as climate change and ecosystem destruction, pose significant threats to humanity and the planet. Addressing these challenges necessitates sustainable resource management and increased efficiency. Artificial intelligence (AI) and ML technologies present promising solutions in this regard. By harnessing AI and ML, we can collect and analyze vast amounts of data from diverse sources, such as remote sensing, smart sensors, and social media. This enables real-time monitoring and decision making in water resource management. AI applications, including irrigation optimization, water quality monitoring, flood forecasting, and water demand forecasting, enhance agricultural practices, water distribution models, and decision making in desalination plants. Furthermore, AI facilitates data integration, supports decision-making processes, and enhances overall water management sustainability. However, the wider adoption of AI in water resource management faces challenges, such as data heterogeneity, stakeholder education, and high costs. To provide an overview of ML applications in water resource management, this research focuses on core fundamentals, major applications (prediction, clustering, and reinforcement learning), and ongoing issues to offer new insights. More specifically, after the in-depth illustration of the ML algorithmic taxonomy, we provide a comparative mapping of all ML methodologies to specific water management tasks. At the same time, we include a tabulation of such research works along with some concrete, yet compact, descriptions of their objectives at hand. By leveraging ML tools, we can develop sustainable water resource management plans and address the world’s water supply concerns effectively.

show abstract

“…Compared with the on-policy algorithm, the off-policy algorithm can be used to explore the environment when collecting interactive data within the environment. Thus, it can improve the utilization efficiency of data and will not affect the performance of the final policy [21,22].…”

Section: On-policy Algorithm and Off-policy Algorithmmentioning

confidence: 99%

An Intelligent Control Method for Servo Motor Based on Reinforcement Learning

Gao,

Wang,

Yang

et al. 2023

Algorithms

View full text Add to dashboard Cite

Servo motors play an important role in automation equipment and have been used in several manufacturing fields. However, the commonly used control methods need their parameters to be set manually, which is rather difficult, and this means that these methods generally cannot adapt to changes in operation conditions. Therefore, in this study, we propose an intelligent control method for a servo motor based on reinforcement learning and that can train an agent to produce a duty cycle according to the servo error between the current state and the target speed or torque. The proposed method can adjust its control strategy online to reduce the servo error caused by a change in operation conditions. We verify its performance on three different servo motors and control tasks. The experimental results show that the proposed method can achieve smaller servo errors than others in most cases.

show abstract

On the sample complexity of actor-critic method for reinforcement learning with function approximation

Cited by 33 publications

References 55 publications

Recent advances in reinforcement learning in finance

Recent advances in reinforcement learning in finance

A Comprehensive Survey of Machine Learning Methodologies with Emphasis in Water Resources Management

An Intelligent Control Method for Servo Motor Based on Reinforcement Learning

Contact Info

Product

Resources

About