“…Referred publications Markov decision process [12,23,24,37,64,70,75,84,96,100,101,104,127,130,133,138,144,153,165,167,170,177,188,191,199], [203, 207, 211, 212, 214, 217, 220, 231, 252, 256-259, 263, 264, 272, 274, 281, 291, 309, 313, 320, 340, 343, 346], [369][370][371][372][373][374][375][376] Multiarmed bandit [61,66,102,198,351,377,378] Dynamic programming [16,19,27,52,68,70,84,90,93,…”