An Ensemble Fuzzy Approach for Inverse Reinforcement Learning

Pan, Wei; Qu, R.T.; Hwang, Kao-Shing; Lin, Hung-Shyuan

doi:10.1007/s40815-018-0535-y

Cited by 7 publications

(4 citation statements)

References 17 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Though the final effect converges to an ideal level, it cannot be proved that it is the optimal reward setting. Accordingly, we will consider the method of inverse reinforcement learning [33,34] to optimize the reward. (2) In this paper, the reinforcement learning-based parking only has the function of reversing (e.g., “step two” in Figure 14), and it cannot automatically adjust the gear forward and backward.…”

Section: Discussionmentioning

confidence: 99%

Reinforcement Learning-Based End-to-End Parking for Automatic Parking System

Zhang

Xiong

et al. 2019

Sensors

View full text Add to dashboard Cite

According to the existing mainstream automatic parking system (APS), a parking path is first planned based on the parking slot detected by the sensors. Subsequently, the path tracking module guides the vehicle to track the planned parking path. However, since the vehicle is non-linear dynamic, path tracking error inevitably occurs, leading to inclination and deviation of the parking. Accordingly, in this paper, a reinforcement learning-based end-to-end parking algorithm is proposed to achieve automatic parking. The vehicle can continuously learn and accumulate experience from numerous parking attempts and then learn the command of the optimal steering wheel angle at different parking slots. Based on this end-to-end parking, errors caused by path tracking can be avoided. Moreover, to ensure that the parking slot can be obtained continuously in the process of learning, a parking slot tracking algorithm is proposed based on the combination of vision and vehicle chassis information. Furthermore, given that the learning network output is hard to converge, and it is easy to fall into local optimum during the parking process, several reinforcement learning training methods in terms of parking conditions are developed. Lastly, by the real vehicle test, it is proved that using the proposed method can achieve a better parking attitude than using the path planning and path tracking-based method.

show abstract

Section: Discussionmentioning

confidence: 99%

Reinforcement Learning-Based End-to-End Parking for Automatic Parking System

Zhang

Xiong

et al. 2019

Sensors

View full text Add to dashboard Cite

show abstract

“…A fuzzy set can affect the reward value obtained by agents by measuring dissimilarity. Pan et al proposed a dissimilarity evaluation metric for deciding the weight value of each agent's reward in ERL [43]. In this way, ERL can achieve a good training effect with fewer iterations.…”

Section: Lin Et Al Proposed An Adaptive Adjustment Methods For Reward...mentioning

confidence: 99%

“…To comprehensively test the effectiveness of algorithms, different models, training algorithms, and integration methods should be separately evaluated [54]. [99] Atari games DQN Chen et al [8] Atari games A3C+ Partalas et al [100] UCI machine learning repository classifier combination methods voting (V) and SMT and the forward selection (FS), selective fusion (SF) Pearce et al [101] Cart Pole control problem Q-learning with different layer NNs Dong et al [53] traffic speed dataset GRU, LSTM, MLP, RBF, LSTM-GRU-GA Pan et al [43] Maze, Mountain Car, Robotic Soccer Game Simulation counterpart Goyal et al [46] CATS (Competition on Artificial Time series) dataset LSTM, ANN, Linear regression, Random Forest, Online NN Macheng Shen and Jonathan P How [102] two-player asymmetric game single model, RNN Qingfeng Lan et al [37] Mountain Car Q-learning, Double Q-learning, Averaged Q-learning Liu et al [54] three different groups of measured wind speed data from Xinjiang wind farms Network: LSTM method, the DBN method, the ESN method; Training algorithm: SARSA Lin et al [41] Maze, soccer robot game orthogonal projection inverse reinforcement learning method (OP-IRL) Junta Wu and Huiyun Li [73] 2D Robot Arm Open Racing Car Simulator (TORCS) DDPG Yang et al [14] Dow Jones 30 constituent stocks (at 01/01/2016) PPO, A2C, DDPG Liu et al [33] UCI online data repository classifiers combination approaches majority voting (MV), weighted voting (WV), ensemble selection methods forward selection (FS) Ghosh et al [38] open source air traffic simulator PPO Jalali et al [81] GHI data sets adaptive hybrid model (AHM), hybrid feature selection method (HFS), Outlier-robust hybrid model (ORHM), novel hybrid deep neural network model (NHDNNM), OHS-LSTM Liu et al [56] data collected from a congested intersection in Changsha RNN, ENN, ESN, DBN, RBF, GRNN, MLP Jalali et al [72] two well-known open-source image datasets named as Mendely and Kaggle original version of GSK and eight powerful evolutionary algorithms including grasshopper optimization algorithm (GOA), Slime mold algorithm (SMA), genetic algorithm, gray wolf optimizer (GWO), particle swarm optimization (PSO), differential evolution (DE), biogeographybased optimization (BBO) Hassam Ullah Sheikh et al [15] Mujoco environments, Atari games TD3, SAC and REDQ Shang et al [30] actual traffic volume data of nine stations of Changsha freeway Chebnet, CNN, LSTM, DBN, RNN, ESN, multi-layer perceptron (MLP) Tan et al …”

Section: Datasets and Compared Methodsmentioning

confidence: 99%

See 1 more Smart Citation

Ensemble Reinforcement Learning: A Survey

Ye¹,

Suganthan²,

Pedrycz³

et al. 2023

Preprint

View full text Add to dashboard Cite

Reinforcement Learning (RL) has emerged as a highly effective technique for addressing various scientific and applied problems. Despite its success, certain complex tasks remain challenging to be addressed solely with a single model and algorithm. In response, ensemble reinforcement learning (ERL), a promising approach that combines the benefits of both RL and ensemble learning (EL), has gained widespread popularity. ERL leverages multiple models or training algorithms to comprehensively explore the problem space and possesses strong generalization capabilities. In this study, we present a comprehensive survey on ERL to provide readers with an overview of recent advances and challenges in the field. First, we introduce the background and motivation for ERL. Second, we analyze in detail the strategies that have been successfully applied in ERL, including model averaging, model selection, and model combination. Subsequently, we summarize the datasets and analyze algorithms used in relevant studies. Finally, we outline several open questions and discuss future research directions of ERL. By providing a guide for future scientific research and engineering applications, this survey contributes to the advancement of ERL.

show abstract