“…A major benefit of learning such a safe action model is that any plan generated with the learned action model for any problem in the same domain is also valid with respect to the real, unknown, action model. Following prior works on learning action models (Amir and Chang 2008;Cresswell, McCluskey, and West 2013;Aineto, Jiménez, and Onaindia 2018) in general and safe action models in particular (Stern and Juba 2017;Juba, Le, and Stern 2021;Mordoch, Stern, and Juba 2023), we assume as input a set of observations of previously executed plans, represented as a set of trajectories. A trajectory T = ⟨s 0 , a 1 , s 1 , .…”