A Deep Reinforcement Learning Approach to Improve the Learning Performance in Process Control

Bao, Yaoyao; Zhu, Yuanming; Qian, Feng

doi:10.1021/acs.iecr.0c05678

Cited by 41 publications

(22 citation statements)

References 29 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In the model-based setting, Kim et al [29] incorporate deep neural networks (DNNs) as value function approximators into the globalized dual heuristic programming algorithm. Predictive models have also been augmented with popular DRL algorithms, such as DDPG or TD3, to improve the policy gradient estimation [30] Other approaches to RL-based control postulate a fixed control structure such as PID [31,32,33]. Brujeni et al [34] develop a model-free algorithm to dynamically select the PID gains from a pre-defined collection derived from Internal Model Control (IMC).…”

Section: Related Workmentioning

confidence: 99%

Deep Reinforcement Learning with Shallow Controllers: An Experimental Application to PID Tuning

Lawrence,

Forbes,

Loewen

et al. 2021

Preprint

View full text Add to dashboard Cite

Deep reinforcement learning (RL) is an optimization-driven framework for producing control strategies for general dynamical systems without explicit reliance on process models. Good results have been reported in simulation. Here we demonstrate the challenges in implementing a state of the art deep RL algorithm on a real physical system. Aspects include the interplay between software and existing hardware; experiment design and sample efficiency; training subject to input constraints; and interpretability of the algorithm and control law. At the core of our approach is the use of a PID controller as the trainable RL policy. In addition to its simplicity, this approach has several appealing features: No additional hardware needs to be added to the control system, since a PID controller can easily be implemented through a standard programmable logic controller; the control law can easily be initialized in a "safe" region of the parameter space; and the final product-a well-tuned PID controller-has a form that practitioners can reason about and deploy with confidence.

show abstract

Section: Related Workmentioning

confidence: 99%

Deep Reinforcement Learning with Shallow Controllers: An Experimental Application to PID Tuning

Lawrence,

Forbes,

Loewen

et al. 2021

Preprint

View full text Add to dashboard Cite

show abstract

“…As to the use of application software, approximately 60% of the studies, utilized MATLAB software for controller development, integration, or real-time implementation. [60,63,69] Other researchers mostly used Tensorflow [64] and Python. [83] Some studies have integrated different functionalities of more than one software applications, for example, (i) MATLAB's process simulation module with Python's ANN generator APIs-PyTorch and Keras [70] and (ii) hybrid training module of Tensorflow and Python's Lambda deep learning workstation.…”

Section: Process Control Applicationsmentioning

confidence: 99%

A survey and comparative evaluation of actor‐critic methods in process control

Dutta

Upreti

2022

Can J Chem Eng

View full text Add to dashboard Cite

Actor-critic (AC) methods have emerged as an important class of reinforcement learning (RL) paradigm that enables model-free control by acting on a process and learning from the consequence. To that end, these methods utilize artificial neural networks, which are synergized for action evaluation and optimal action prediction. This feature is highly desirable for process control, especially when the knowledge about a process is limited or when it is susceptible to uncertainties. In this work, we summarize important concepts of AC methods and survey their process control applications. This treatment is followed by a comparative evaluation of the set-point tracking and robustness of controllers based on five prominent AC methods, namely, DDPG, TD3, SAC, PPO, and TRPO, in five case studies of varying process nonlinearity. The training demands and control performances indicate the superiority of DDPG and TD3 methods, which rely on off-policy, deterministic search for optimal action policies. Overall, the knowledge base and results of this work are expected to serve practitioners in their efforts toward further development of autonomous process control strategies.

show abstract

“…This present work differs significantly from the approaches mentioned so far. Other approaches to more sample-efficient RL in process control utilize apprenticeship learning, transfer learning, or model-based strategies augmented with deep RL algorithms [29,21,30]. Our method differs in two significant ways: the training and deployment process is simplified with our meta-RL agent through its synthesized training over a large distribution of systems.…”

Section: Related Workmentioning

confidence: 99%

“…By picking a slow sampling time, the tank's dynamics appear faster from the perspective of the meta-RL agent. To geometrically center the time constant in Equation ( 28) to the meta-RL's task distribution, we set the sampling time to every 30 0.5 = 60 seconds. The true time constant of 55 seconds then appears as a time constant of 0.92 to the meta-RL agent.…”

Section: Adapting the Meta-rl Model To The Two Tank Systemmentioning

confidence: 99%

Meta-Reinforcement Learning for the Tuning of PI Controllers: An Offline Approach

McClement¹,

Lawrence²,

Backström³

et al. 2022

Preprint

View full text Add to dashboard Cite

Meta-learning is a branch of machine learning which trains neural network models to synthesize a wide variety of data in order to rapidly solve new problems. In process control, many systems have similar and well-understood dynamics, which suggests it is feasible to create a generalizable controller through meta-learning. In this work, we formulate a meta reinforcement learning (meta-RL) control strategy that takes advantage of known, offline information for training, such as the system gain or time constant, yet efficiently controls novel systems in a completely model-free fashion. Our meta-RL agent has a recurrent structure that accumulates "context" for its current dynamics through a hidden state variable. This end-to-end architecture enables the agent to automatically adapt to changes in the process dynamics. Moreover, the same agent can be deployed on systems with previously unseen nonlinearities and timescales. In tests reported here, the meta-RL agent was trained entirely offline, yet produced excellent results in novel settings. A key design element is the ability to leverage model-based information offline during training, while maintaining a model-free policy structure for interacting with novel environments. To illustrate the approach, we take the actions proposed by the meta-RL agent to be changes to gains of a proportional-integral controller, resulting in a generalized, adaptive, closed-loop tuning strategy. Meta-learning is a promising approach for constructing sample-efficient intelligent controllers.

show abstract

A Deep Reinforcement Learning Approach to Improve the Learning Performance in Process Control

Cited by 41 publications

References 29 publications

Deep Reinforcement Learning with Shallow Controllers: An Experimental Application to PID Tuning

Deep Reinforcement Learning with Shallow Controllers: An Experimental Application to PID Tuning

A survey and comparative evaluation of actor‐critic methods in process control

Meta-Reinforcement Learning for the Tuning of PI Controllers: An Offline Approach

Contact Info

Product

Resources

About