A single-time velocity estimator-based reinforcement learning (RL) algorithm, integrated with a chaotic metaheuristic optimization technique is proposed in this article for the optimal path-planning of the nonholonomic robots considering a moving/stationary obstacle avoidance strategy. The additional contribution of the present study is by employing the Terramechanics principles to incorporate the effects of wheel sinkage into the deformable terrain on the dynamics of the robot aiming to find the optimal compensating force/torque magnitude to sustain a robust and smooth motion. The designed systematic control-oriented system incorporates a cost function of weighted components associated with the targettracking and the obstacle avoidance. The designed velocity estimator contributes to the finite-state Markov decision process (MDP) in order to train the transition probabilities of the problem objectives. Based on the obtained results, the optimal solution for the Q-learning in terms of the adjusting factor for the minimized tracking error and obstacle collision risk propagation profiles is found at 0.22. The results further confirm the promising capacity of the proposed optimization-based RL algorithm for the collision avoidance control of the nonholonomic robots on deformable terrains. INDEX TERMS Mechatronics, terramechanics, path-planning, artificial intelligence.
Wheel dynamics play a substantial role in traversing and controlling the vehicle, braking, ride comfort, steering, and maneuvering. The transient wheel dynamics are difficult to be ascertained in tire-obstacle contact condition. To this end, a single-wheel testing rig was utilized in a soil bin facility for provision of a controlled experimental medium. Differently manufactured obstacles (triangular and Gaussian shaped geometries) were employed at different obstacle heights, wheel loads, tire slippages and forward speeds to measure the forces induced at vertical and horizontal directions at tire-obstacle contact interface. A new Takagi-Sugeno type neuro-fuzzy network system with a modified Differential Evolution (DE) method was used to model wheel dynamics caused by road irregularities. DE is a robust optimization technique for complex and stochastic algorithms with ever expanding applications in real-world problems. It was revealed that the new proposed model can be served as a functional alternative to classical modeling tools for the prediction of nonlinear wheel dynamics.
In this paper, a novel algorithm is proposed for the motion planning and path following automated cars with the incorporation of a collision avoidance strategy. This approach is aligned with an optimal reinforcement learning (RL) coupled with a new risk assessment approach. For this purpose, a probabilistic function-based collision avoidance strategy is developed, and the proposed RL approach learns the probability distributions of the adjacent and leading vehicles. Subsequently, the nonlinear model predictive control (NMPC) algorithm approximates the optimal steering input and the required yaw moment to follow the safest and shortest path through the optimal RL-based probabilistic risk function framework. Additionally, it is attempted to maintain the travel speed for the ego vehicle stable such that the ride comfort is also offered for the vehicle occupants. For this purpose, the steering system dynamics are also incorporated to provide a thorough understanding of the vehicle dynamics characteristic. Different driving scenarios are employed in the present paper to evaluate the proposed algorithm’s effectiveness.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.