Neural Network-Based Adaptive Motion Control for a Mobile Robot with Unknown Longitudinal Slipping

Wang, Gang; Liu, Xiaoping; Zhao, Yunlong; Han, Song

doi:10.1186/s10033-019-0373-3

Cited by 17 publications

(5 citation statements)

References 32 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Dijkstra 算法采用遍历搜索模式 [2] ，可以用于求解一个顶点到其余各个顶点的最短路径问题，但该算法的计算节点多，计算量大，导致寻路速度慢，不适用于复杂环境 [3] 。HART 等在 1968 年提出的 A*算法是在 Dijkstra 算法的基础上加入了启发函数 [4] ，根据代价值来评价扩展节点的优劣，进而找出最佳的扩展节点，直至扩展到目标点。虽然 A* 算法减少了扩展节点的数量，提高了运算效率，但是得到的路线存在转折点多，安全距离小等问题 [5] 。段书用等在 A*算法中引入安全距离矩阵和优化启发函数，减少了路径转折点，提高了路径安全性 [6] 。 JPS 算法引入定向搜索模式 [7] ，进一步优化了 A*算法寻找后继节点的操作，减少了转折点和扩展节点数量，提高了路径的平滑性和算法寻路的运算速度 [8] 。由 KHATIB 在 1986 年提出的人工势场法根据目标点和障碍物所在位置，在地图中构建出引力场和斥力场，通过势函数的下降方向规划出一条安全路径 [9] 。然而，在复杂环境中所构建的势能场易出现多个零势能点，导致路径无法到达目标点 [10] 。快速随机树搜索算法是基于随机采样的路径规划算法，根据搜索树上已有的节点不断拓展出新节点，进而找到目标位置。该算法搜索能力强，搜索速度快，但同样存在搜索精度低、路径平滑性差等问题 [11][12] 。DORIGO 根据蚂蚁协同觅食的现象，提出了蚁群算法，该算法鲁棒性好且易于与其他算法融合，但存在收敛速度慢等问题 [13][14] 。神经网络算法是通过对大量数据样本进行训练，进而求出最优解。但该算法需要花大量时间用于训练样本且对网络模型要求较高 [15][16] 。上述算法适用于已知全局环境信息的情况 [17] ，当静态障碍物信息未知时，则不适用。由 WATKINS 提出的 Q-Learning 算法，不依赖环境模型，不需要环境的先验知识，利用自身的奖惩机制，通过试错便可找到一条从起点到终点的安全路径 [18] ，而且 Q-Learning 算法鲁棒性好。因此，呈现了很多基于 Q-Learning 算法的改进研究，如：LOW 等基于花朵授粉原理，改进了 Q-Learning 算法的初始化过程，提高了算法的收敛速度 [19] ；MARTHI 将奖惩函数进行分解，提高了 Agent 的学习效率 [20] ； BIANCHI 等提出了 Heuristically Accelerated Q-Learning 算法通过引入可在线更新的启发函数来指导 Agent 选择动作 [21] 。这些改进的 Q-Learning 算法虽然提高了 Agent 的学习效率，但直接应用于路径规划后，得到的路径存在转折点多，路径不平滑等问题 [22][23] 。综上所述，本文基于 Q-Learning 算法和改进的 Q-Learning 算法 [24][25][26] ，提出一种 Smoothed-Shortcut…”

Section: 路径规划是指依据某种判定准则，在已知或未知全局作业环境中寻找一条可以从初始位置到目标unclassified

Smoothed-shortcut Q-Learning Algorithm for Optimal Robot Agent Path Planning

2022

Journal of Mechanical Engineering

View full text Add to dashboard Cite

：Quality path planning for a mobile robot in operation is the key to completing the task safely, efficiently and smoothly. Such a path planning often needs to be done base only on a given environment that is unknown to the Agent at the beginning, and an effective reinforcement learning is required. Smoothed-shortcut Q-learning (SSQL) Algorithm is presented that enable the Agent to learn and then figure out a smoothed short-cut path to the final goal that is initially unknown to the Agent in a given environment. The SSQL is proposed to solve practical problems for mobile robots effectively arrive at its goal in a strange environment, with a path that is a smooth and continuous curve of shortest distance. The SSQL algorithm consists three major ingredients. First, a virtual rectangular environment boundary of the environment is constructed, based on the pre-explored information. The Q values of guidance point for the virtual rectangular environment are increased to improve the learning efficiency of the Agent. Second, the path found by the Agent at the current time is then optimized by finding short-cuts along the path to eliminate the possible redundant paths and reduce the zig-zag segments, minimizing the total distance between the starting and target point. Third, at the turning positions on the path, the Bezier curve is used to further smooth the path, so as to improve the dynamics for the movement of the robot agent. The final path generated by our SSQL algorithm will be optimal in terms of fast convergence, smoothness and shortest distance. The SSQL algorithm is tested by comparison with the standard Q-Learning algorithm in different environments with various obstacle * 国家自然科学基金(52175222)、河北省重点研发计划(19227208D)和天津市科技计划项目(19ZXZNGX00100)资助项目。 20210722 收到初稿， 20220124 收到修改稿月 2022 年 6 月段书用等：具有光滑-直行功能的 Q-Learning 路径优化算法 73densities and learning rate. The results show that our SSQL algorithm has indeed achieved fast convergence, short and smooth paths, and with few turning points.

show abstract

Section: 路径规划是指依据某种判定准则，在已知或未知全局作业环境中寻找一条可以从初始位置到目标unclassified

Smoothed-shortcut Q-Learning Algorithm for Optimal Robot Agent Path Planning

2022

Journal of Mechanical Engineering

View full text Add to dashboard Cite

show abstract

“…According to their novel features, various practical industries like medical professions use these platforms [1][2][3]. For this reason, mobile robots, especially wheeled ones, have received vast attractions and treated by many robotic researchers [4][5][6]. In addition, in order to enhance the output and their effectiveness, researchers have enthusiastically endeavored to generate optimal paths for mobile robotic platforms.…”

Section: Introductionmentioning

confidence: 99%

Optimal trajectory tracking of tractor-trailer wheeled platforms taken into account wheel dynamics

Nazemizadeh,

Kolahi,

Gandomkar

2024

Eng. Res. Express

View full text Add to dashboard Cite

Tractor-trailer wheeled platforms are recognized as articulated robotic systems comprising a frontrunner active mobile platform (known as tractor) which pulls one or more passive platforms (known as trailers). Regarding to their novel features like as maximum load carrying capacity and minimum energy consumption, these categories of mobile platforms are widely employed for conveying purposes and transportation. The presented article deals with path tracking optimization of the tractor-trailer moving platform leveraging indirect method of the optimal control. First, the nonlinear motion relations of the modular platform are obtained. It means relations pertaining to kinematic constraints and inertia characteristics of wheels. Then, the obtained dynamic equations are supposed as constraints of the optimal problem whereupon an appropriate objective criterion is determined for the optimal trajectory tracking. Next, an appropriate solution to the problem and simulations presented. The obtained results reveal competency and applicability of the proposed procedure for the optimal tracking of the tractor-trailer wheeled mobile platforms in view of their wheels dynamics.

show abstract

“…Usually, in similar studies, differential equations are used [1], [2], [4], [5] to represent the mathematical model. Instead, the D'Alembert's principle is used here, which considers an equilibrium system of forces, including inertial forces [6]; the accuracy of the calculations does not deteriorate.…”

Section: Introductionmentioning

confidence: 99%

Longitudinal Stability of Wheeled Mobile Robots - Degree of Stability

Lilov,

Markova,

Popov

et al. 2024

ETR

View full text Add to dashboard Cite

In this paper, the research focus is on the longitudinal stability of a wheeled mobile robot, using a geometric similarity coefficient (this coefficient is defined in the paper). The research method used for calculations is D'Alembert's principle. The results represent the limit driving/braking forces and limit accelerations/ decelerations for a given geometric similarity coefficient, before the robot loses stability.

show abstract

Neural Network-Based Adaptive Motion Control for a Mobile Robot with Unknown Longitudinal Slipping

Cited by 17 publications

References 32 publications

Smoothed-shortcut Q-Learning Algorithm for Optimal Robot Agent Path Planning

Smoothed-shortcut Q-Learning Algorithm for Optimal Robot Agent Path Planning

Optimal trajectory tracking of tractor-trailer wheeled platforms taken into account wheel dynamics

Longitudinal Stability of Wheeled Mobile Robots - Degree of Stability

Contact Info

Product

Resources

About