“…When it comes to full information control,
40 where the control input can depend on disturbance, that is, the controller can use disturbance as a feedforward signal, controllers designed in the Nash game framework cannot make full use of the disturbance information. The algorithms in existing works
35,37‐39 indicate that the disturbance
is known since the disturbance data was needed to train the Q‐function. However, the controllers' designs in previous works do not make use of the disturbance information, therefore, is conservative.…”