DOB-Net: Actively Rejecting Unknown Excessive Time-Varying Disturbances

Wang, Tianming; Lu, Wenjie; Yan, Zheng; Liu, Dikai

doi:10.48550/arxiv.1907.04514

Cited by 4 publications

(11 citation statements)

References 26 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…However, this is not the case when the disturbances are considered as the unobservable parts of the state space, since it it difficult to formulate a transition function to predict next disturbances from current state (including disturbances) and action only. Both history window approach [31] and recurrent policy [32] attempt to resolve this issue through characterizing the disturbed system transition as a multi-step MDP, and assuming the unobservable disturbance waveforms are encoded in robot motion history. The difference lies in the way to use the history data, the history window approach directly takes most recent state-action pairs as additional input to the policy, while the recurrent policy employs RNN to explore past experience in order to learn an optimal embedding of history data.…”

Section: A Reinforcement Learning In Partially Observable Markov Deci...mentioning

confidence: 99%

“…Previous work [32] has demonstrated that RNN can directly learn to control a dynamical system with unobservable disturbances in an end-to-end mode, where the past motion history is mapped to the control action. While, inspired by [33], this work applies modular learning procedures, that explicitly decouple the process into disturbance identification and motion control.…”

Section: Modular Network Designmentioning

confidence: 99%

“…After several iterations (4 in our case), the disturbance rejection capability of the robot reaches a relatively high level, even approaching the performance of GCP when given the true disturbance parameters μ. In addition, we also evaluate an endto-end learning framework for excessive disturbance rejection, called Disturbance Observer Network (DOB-Net) [32], as a comparison with the modular architecture of GCP-ODI. We found that DOB-Net also achieves an excellent stability under the same task settings, but not as good as the iteratively trained GCP-ODI (4th iteration).…”

Section: Modular Networkmentioning

confidence: 99%

See 2 more Smart Citations

Modular Transfer Learning with Transition Mismatch Compensation for Excessive Disturbance Rejection

Wang¹,

Liang²,

Yu³

et al. 2020

Preprint

Self Cite

View full text Add to dashboard Cite

Section: A Reinforcement Learning In Partially Observable Markov Deci...mentioning

confidence: 99%

Section: Modular Network Designmentioning

confidence: 99%

Section: Modular Networkmentioning

confidence: 99%

See 1 more Smart Citation

Modular Transfer Learning with Transition Mismatch Compensation for Excessive Disturbance Rejection

Wang¹,

Liang²,

Yu³

et al. 2020

Preprint

Self Cite

View full text Add to dashboard Cite

“…In [16], ILC is used to generate a correction signal for DOB to enhance disturbance attenuation when the major component of the disturbance is repetitive. Besides, neural networks have also been introduced to enhance DOB's performance [17][18][19][20]. For example, in [17], a radial basis function NN is combined with DOB to deal with both unknown dynamics and external disturbances; in [20], the conventional DOB is enhanced via Recurrent Neural Networks for disturbance estimation and prediction.…”

Section: Introductionmentioning

confidence: 99%

“…Moreover, the performance of conventional DOB depends highly on an accurate plant inverse which usually is not available or is very sensitive to uncertainties, and this significantly limits DOB's performance. Recently, deep learning techniques have been developed and applied to highlevel decision making (e.g., [21][22][23]) and low-level trajectory planning and tracking (e.g., [20,[24][25][26]). Since the drone delivery scenarios considered in this paper is relatively structured, here we leverage the deep learning techniques in convolutional neural network (CNN) and long short-term memory (LSTM) network to include the image-based perception into the DOB framework, aiming to improve DOB's performance.…”

Section: Introductionmentioning

confidence: 99%

Including Image-based Perception in Disturbance Observer for Warehouse Drones

Zhu

Liang

Zheng

2020

Preprint

View full text Add to dashboard Cite

Grasping and releasing objects would cause oscillations to delivery drones in the warehouse. To reduce such undesired oscillations, this paper treats the to-be-delivered object as an unknown external disturbance and presents an imagebased disturbance observer (DOB) to estimate and reject such disturbance. Different from the existing DOB technique that can only compensate for the disturbance after the oscillations happen, the proposed image-based one incorporates imagebased disturbance prediction into the control loop to further improve the performance of the DOB. The proposed imagebased DOB consists of two parts. The first one is deep-learningbased disturbance prediction. By taking an image of the tobe-delivered object, a sequential disturbance signal is predicted in advance using a connected pre-trained convolutional neural network (CNN) and a long short-term memory (LSTM) network. The second part is a conventional DOB in the feedback loop with a feedforward correction, which utilizes the deep learning prediction to generate a learning signal. Numerical studies are performed to validate the proposed image-based DOB regarding oscillation reduction for delivery drones during the grasping and releasing periods of the objects.

show abstract

DOB-Net: Actively Rejecting Unknown Excessive Time-Varying Disturbances

Cited by 4 publications

References 26 publications

Modular Transfer Learning with Transition Mismatch Compensation for Excessive Disturbance Rejection

Modular Transfer Learning with Transition Mismatch Compensation for Excessive Disturbance Rejection

Including Image-based Perception in Disturbance Observer for Warehouse Drones

A2: Extracting cyclic switchings from DOB-nets for rejecting excessive disturbances

Contact Info

Product

Resources

About