“…The regret is defined as r n (τ, p (i) ) = E w(X * i ) − w(X * i,j,n,τ ) , where X * i represents the true maximizer of population p (i) , and X * i,j,n,τ the maximizer in D (i) n,j according to an estimator τ = {ŵ,ŵ 0 ,ŵ0,ŵ0}, for which we use exhaustive search to obtain. 4 The expected value is with respect to j ∈ [1,500]. We average regrets across the different p (i) to obtain r n (τ, P [u,v] [a,b] ), e.g., r n (τ, P [2,3] [0,0.5] ) would be the average regret of estimator τ across all p (i) ∈ P 3 [0,0.5] and p (i) ∈ P 4 [0,0.5] .…”