“…where (j|i) denotes the exogenously given probability that the state transitions from i to j, β is the discount factor, and α t+1 j (a t ) denotes the ignorance equivalent of the continuation problem that ensues under the posteriors associated with action a t . The ignorance equivalent in turn is determined from the choice probabilities in the subsequent periods, dating back to Caplin, Dean, and Leahy (2018) and generalized in Müller-Itten, Armenter, and Stangebye (2023),…”