2020
DOI: 10.3390/s20247297
|View full text |Cite
|
Sign up to set email alerts
|

Data Efficient Reinforcement Learning for Integrated Lateral Planning and Control in Automated Parking System

Abstract: Reinforcement learning (RL) is a promising direction in automated parking systems (APSs), as integrating planning and tracking control using RL can potentially maximize the overall performance. However, commonly used model-free RL requires many interactions to achieve acceptable performance, and model-based RL in APS cannot continuously learn. In this paper, a data-efficient RL method is constructed to learn from data by use of a model-based method. The proposed method uses a truncated Monte Carlo tree search … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
14
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 21 publications
(17 citation statements)
references
References 31 publications
0
14
0
Order By: Relevance
“…The constraints for the state vector of the MPC problem are defined as (25). The input constraints are defined as (26) and (27), which mean a maximum value and jerk limit to each actuator needed to consider the actuator limit. The inequality constraints on steering of 4WIS are defined as (26).…”
Section: Cost Function and Constraintsmentioning
confidence: 99%
See 3 more Smart Citations
“…The constraints for the state vector of the MPC problem are defined as (25). The input constraints are defined as (26) and (27), which mean a maximum value and jerk limit to each actuator needed to consider the actuator limit. The inequality constraints on steering of 4WIS are defined as (26).…”
Section: Cost Function and Constraintsmentioning
confidence: 99%
“…In (26), δ max and δ' max are the maximum steering angle and its angular rate, respectively. The inequality constraints for drive torque of 4WID are defined as (27). In (27), F x,max, and F' x,max are the maximum longitudinal tire force and its rate, respectively.…”
Section: Cost Function and Constraintsmentioning
confidence: 99%
See 2 more Smart Citations
“…Similarly, authors of [23] combine Model Predictive Control (MPC) with RBFNN to robustly handle the nonlinear characteristics of the steering system. A reinforcement-learning-based integrated planning and control method is proposed for automated parking applications in [24], which can simultaneously coordinate the longitudinal and lateral motions to park in a smaller parking space in one maneuver. Reinforcement learning is also used for high velocity lane change maneuvering in [25], where a Deep Deterministic Policy Gradient (DDPG) agent is utilized in an end-to-end method using lidar data.…”
Section: Introduction 1literature Outlookmentioning
confidence: 99%