“…We can highlight state space design [12,25,33,107,144,179,193,208,217,220,222,224,227,266,267] and action space design [109,220,246,268], reward construction [14,76,110,199,220,226,246,[269][270][271][272][273], and exploration strategy planning [86,274] which can be determinants from the whole application point of view. [11,13,17,20,21,24,38,43,61,62,66,69,82,89,93], Allocation, assignment, resource management [20,22,…”