“…Scientific computation and control of the emergent/collective dynamics of high-dimensional multiscale/complex dynamical systems constitute open challenging tasks due to (a) the lack of physical insight and knowledge of the appropriate macroscopic quantities needed to usefully describe the evolution of the emergent dynamics, (b) the so-called "curse of dimensionality" when trying to efficiently learn surrogate models with good generalization properties, and (c) the problem of bridging the scale where individual units (atoms, molecules, cells, bacteria, individuals, robots) interact, and the macroscopic scale where the emergent properties arise and evolve [1][2][3][4]. For the task of identification of macroscopic variables from high-fidelity simulations/spatio-temporal data, various machine learning methods have been proposed including non-linear manifold learning algorithms such as Diffusion Maps (DMs) [5][6][7][8][9][10][11][12][13], ISOMAP [14][15][16] and Local Linear Embedding [17,18] but also Autoencoders [19,20]. For the task of the extraction of surrogate models for the approximation of the emergent dynamics, available approaches include the Sparse Identification of the Nonlinear Dynamics (SINDy) [21], the Koopman operator [22][23][24][25][26][27], Gaussian Processes [12,18,28], Artificial Neural Networks (ANNs) [12,13], Recursive Neural Networks (RNN) [20], Deep Learning [29], as well as Long Short-Term Memory (LSTM) networks [30].…”