Knowledge of the hidden factors that determine particular system dynamics is crucial for both explaining them and pursuing goal-directed, interventional actions. The inference of these factors without supervision given time series data remains an open challenge. Here, we focus on spatio-temporal processes, including wave propagations and weather dynamics, and assume that universal causes (e.g. physics) apply throughout space and time. We apply a novel DIstributed, Spatio-Temporal graph Artificial Neural network Architecture, DISTANA, which learns a generative model in such domains. DISTANA requires fewer parameters, and yields more accurate predictions than temporal convolutional neural networks and other related approaches on a 2D circular wave prediction task. We show that DISTANA, when combined with a retrospective latent state inference principle called active tuning, can reliably derive hidden local causal factors. In a current weather prediction benchmark, DISTANA infers our planet's land-sea mask solely by observing temperature dynamics and uses the self inferred information to improve its own prediction of temperature. We are convinced that the retrospective inference of latent states in generative RNN architectures will play an essential role in future research on causal inference and explainable systems.
InroductionWhen considering our planet's weather, centuries of past research have identified a large number of factors that affect its highly nonlinear and partially chaotic dynamics. Yet, can we ever be sure of having identified all hidden causal factors? Moreover, do we have (sufficient) data about them? These are fundamental questions in any prediction or forecasting task, including other spatio-temporal tasks such as soil property dynamics, traffic forecasting, energy-flow prediction (e.g in brains or supply networks), or recommender systems. Here we investigate how unobservable hidden causes may be inferred from spatio-temporal data streams.