“…A group of learning automata can cooperate to cope with many hard-to-solve problems. To name just a few, learning automata have a wide variety of applications in combinatorial optimization problems [34][35][36], computer networks [37][38][39][40][41], queuing theory [42], signal processing [43], information retrieval [44], adaptive control [45], and pattern recognition [46]. The environment can be described by a triple E : {a, b, c}, where a : {a 1 , a 2 , …, a r } represents the finite set of the inputs, b : {b 1 , b 2 , …, b m } denotes the set of the values that can be taken by the reinforcement signal, and c : {c 1 , c 2 , …, c r } denotes the set of the penalty probabilities, where the element c i is associated with the given action a i .…”