“…where Jπ R,µ is the average number of command actions under policy π R,µ , which is calculated using (17). From (17) and the fact that (P3) is decoupled into K per-sensor problems (P4), Jπ R,µ is calculated as Jπ R,µ = 1 K K k=1 Jπ R,µ,k , where Jπ R,µ,k denotes the per-sensor time average number of command actions under the per-sensor policy π R,µ,k , which is defined as…”