A Case Study: Modeling of A Passive Flexible Link on A Floating Platform for Intervention Tasks

Wang, Tianming; Lu, Wenjie; Liu, Dikai

doi:10.1109/wcica.2018.8630398

Cited by 3 publications

(2 citation statements)

References 33 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In each iteration, more training samples are generated following the previous procedures, but the disturbances used by GCP now come from the predicted disturbance parameters of ODI μ, rather than the true disturbance parameters μ used in the system dynamics. Then, ODI is trained again through combining the mismatched training samples with previously gathered ones according to (6). After a small number of iterations, the combined system, GCP-ODI, achieves close performance with GCP that is fed with the true disturbance parameters μ.…”

Section: B Learning Online Disturbance Identification Modelmentioning

confidence: 99%

“…Owing to this decay, as well as the considerable size and thrust capabilities of underwater robotic systems, the strength and changes of ocean waves are often neglected in robot motion planning and control in deep water applications [4]. In field applications with low operational depths and turbulent wave climates, like bridge pile inspection [5] and sea-ice algae characterization in Antarctica [6], this assumption can quickly break down, since shallow water environments usually accommodate only small-size robots that have limited thrust capabilities, and the disturbances coming from the turbulent flows are time-varying and may frequently exceed robot's thrust capabilities (such wave forces are termed as excessive disturbances throughout this paper). As a result, increased wave forces inevitably hinder the stability and precision of robot motion control [7]- [9].…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Modular Transfer Learning with Transition Mismatch Compensation for Excessive Disturbance Rejection

Wang¹,

Liang²,

Yu³

et al. 2020

Preprint

Self Cite

View full text Add to dashboard Cite

Section: B Learning Online Disturbance Identification Modelmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Modular Transfer Learning with Transition Mismatch Compensation for Excessive Disturbance Rejection

Wang¹,

Liang²,

Yu³

et al. 2020

Preprint

Self Cite

View full text Add to dashboard Cite

Modular transfer learning with transition mismatch compensation for excessive disturbance rejection

Wang

et al. 2022

Int. J. Mach. Learn. & Cyber.

View full text Add to dashboard Cite

DOB-Net: Actively Rejecting Unknown Excessive Time-Varying Disturbances

Wang

Yan

et al. 2020

2020 IEEE International Conference on Robotics and Automation (ICRA)

Self Cite

View full text Add to dashboard Cite

This paper presents an observer-integrated Reinforcement Learning (RL) approach, called Disturbance OBserver Network (DOB-Net), for robots operating in environments where disturbances are unknown and time-varying, and may frequently exceed robot control capabilities. The DOB-Net integrates a disturbance dynamics observer network and a controller network. Originated from conventional DOB mechanisms, the observer is built and enhanced via Recurrent Neural Networks (RNNs), encoding estimation of past values and prediction of future values of unknown disturbances in RNN hidden state. Such encoding allows the controller generate optimal control signals to actively reject disturbances, under the constraints of robot control capabilities. The observer and the controller are jointly learned within policy optimization by advantage actor critic. Numerical simulations on position regulation tasks have demonstrated that the proposed DOB-Net significantly outperforms a conventional feedback controller and classical RL algorithms.

show abstract

A Case Study: Modeling of A Passive Flexible Link on A Floating Platform for Intervention Tasks

Cited by 3 publications

References 33 publications

Modular Transfer Learning with Transition Mismatch Compensation for Excessive Disturbance Rejection

Modular Transfer Learning with Transition Mismatch Compensation for Excessive Disturbance Rejection

Modular transfer learning with transition mismatch compensation for excessive disturbance rejection

DOB-Net: Actively Rejecting Unknown Excessive Time-Varying Disturbances

Contact Info

Product

Resources

About