“…Initial attempts to imitate hippocampal episodic control (Lengyel & Dayan, 2007) in AI agents used non-parametric (Blundell et al, 2016) and semi-parametric (Pritzel et al, 2017) memories (which store state information from every timestep), enabling rapid learning when compared to standard deep RL algorithms. Scaling such memory is an open challenge, with possible solutions including more sophisticated storage/forgetting mechanisms, compression (Agostinelli, Arulkumaran, Sarrico, Richemond, & Bharath, 2019), and hierarchy (Lampinen et al, 2021). Still, these algorithms only correspond to replay, while combining episodic memories with generative models could lead to further abilities, such as planning to find outcomes that are of specific relevance to the agent (Zakharov et al, 2021).…”