“…Examples of the latter type include algorithms employing various pruning heuristics [Mon82, CLZ97, HF00a, Hau00, RG02, TK03, PB04, SV05, PGT06, RPPCD08, SV10, SS12, SYHL13, GHL19] and algorithms which optimize over restricted classes of policies [Han98, MKKC99, KMN99, LYX11, AYA18]. To our knowledge, some of the only works presenting subexponential time approximate planning algorithms are [BDRS96] and [MY20,KY20] (ignoring end-to-end learning algorithms, which we discuss later).…”