A Neural Temporal Model for Human Motion Prediction

Gopalakrishnan, Anand; Mali, Ankur; Kifer, Daniel; Giles, C. Lee; Ororbia, Alexander G.

doi:10.1109/cvpr.2019.01239

Cited by 162 publications

(94 citation statements)

References 21 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…state-of-the-art performance in many important tasks that are sequential in nature. These tasks range from those in statistical machine translation [6], to language modeling and text processing [7], [8], [9], to long and short-term human motion generation [10], to speech recognition [11]. To train these powerful recurrent networks, back-propagation through time (BPTT) has long been the the primary algorithm of choice for computing parameter updates.…”

Section: Introductionmentioning

confidence: 99%

Continual Learning of Recurrent Neural Networks by Locally Aligning Distributed Representations

Ororbia

Mali

Giles

et al. 2020

IEEE Trans. Neural Netw. Learning Syst.

Self Cite

View full text Add to dashboard Cite

Temporal models based on recurrent neural networks have proven to be quite powerful in a wide variety of applications, including language modeling and speech processing. However, training these models often relies on back-propagation through time, which entails unfolding the network over many time steps, making the process of conducting credit assignment considerably more challenging. Furthermore, the nature of backpropagation itself does not permit the use of non-differentiable activation functions and is inherently sequential, making parallelization of the underlying training process difficult.Here, we propose the Parallel Temporal Neural Coding Network (P-TNCN), a biologically inspired model trained by the learning algorithm we call Local Representation Alignment. It aims to resolve the difficulties and problems that plague recurrent networks trained by back-propagation through time. The architecture requires neither unrolling in time nor the derivatives of its internal activation functions. We compare our model and learning procedure to other back-propagation through time alternatives (which also tend to be computationally expensive), including real-time recurrent learning, echo state networks, and unbiased online recurrent optimization. We show that it outperforms these on sequence modeling benchmarks such as Bouncing MNIST, a new benchmark we denote as Bouncing NotMNIST, and Penn Treebank. Notably, our approach can in some instances outperform full back-propagation through time as well as variants such as sparse attentive back-tracking.Significantly, the hidden unit correction phase of P-TNCN allows it to adapt to new datasets even if its synaptic weights are held fixed (zero-shot adaptation) and facilitates retention of prior generative knowledge when faced with a task sequence. We present results that show the P-TNCN's ability to conduct zero-shot adaptation and online continual sequence modeling.

show abstract

Section: Introductionmentioning

confidence: 99%

Continual Learning of Recurrent Neural Networks by Locally Aligning Distributed Representations

Ororbia

Mali

Giles

et al. 2020

IEEE Trans. Neural Netw. Learning Syst.

Self Cite

View full text Add to dashboard Cite

show abstract

“…To include action label information, we concatenate a onehot encoded action type vector with each pose, similar to recent literature [6], [8], [21]. With the action label and human motion learned by our autoencoder, this knowledge can be used to solve the action classification task.…”

Section: Action Classification and Label Recoverymentioning

confidence: 99%

“…We follow the same evaluation method for short-term prediction as in [4]- [8], [10], [11], [21]. We cite the results from the most relevant works to compare with our method, which are Res-Seq2Seq [6], the model by Tang et al [11], VGRU-rl [21] and AGED [8] which is the current state of the art. We also compare against the naive zero-velocity baseline proposed by [6] and use their code to generate long-term predictions.…”

Section: A Baselinesmentioning

confidence: 99%

“…We use two normalization methods depending on the baseline that we are comparing against: (1) subtract the mean and normalized between -1 and 1, which is used in [11], and (2) subtract the mean and divide by the standard deviation, which is used by the other methods [6], [8], [21]. Table I: Comparison of mean angle error between our method and top performing baselines for short-term motion prediction.…”

Section: B Data Preprocessingmentioning

confidence: 99%

See 1 more Smart Citation

Human Motion Prediction Via Pattern Completion in Latent Representation Space

Meger

2019

2019 16th Conference on Computer and Robot Vision (CRV)

View full text Add to dashboard Cite

Inspired by ideas in cognitive science, we propose a novel and general approach to solve human motion understanding via pattern completion on a learned latent representation space. Our model outperforms current state-of-the-art methods in human motion prediction across a number of tasks, with no customization. To construct a latent representation for time-series of various lengths, we propose a new and generic autoencoder based on sequence-to-sequence learning. While traditional inference strategies find a correlation between an input and an output, we use pattern completion, which views the input as a partial pattern and to predict the best corresponding complete pattern. Our results demonstrate that this approach has advantages when combined with our autoencoder in solving human motion prediction, motion generation and action classification.

show abstract

“…To preserve the motion trend in long-term prediction, we propose a bi-directional predictor enhanced by a bidiscriminator to adversarially revise the generated forward and backward motion dynamics. From a single forward predictor [8], [10], [17], prediction errors are rapidly accumulated along the temporal domain since RNN models fail to keep the long-term knowledge in recurrent steps, causing the generated motion drifting to a wrong direction. To this end, we train a backward predictor to encode the velocity in reversed timesteps, such that the model recovers the context from the beginning dynamics that are lost during long sequence transition.…”

Section: Introductionmentioning

confidence: 99%

A Quadruple Diffusion Convolutional Recurrent Network for Human Motion Prediction

Men

Shum

et al. 2021

IEEE Trans. Circuits Syst. Video Technol.

View full text Add to dashboard Cite

Recurrent neural network (RNN) has become popular for human motion prediction thanks to its ability to capture temporal dependencies. However, it has limited capacity in modeling the complex spatial relationship in the human skeletal structure. In this work, we present a novel diffusion convolutional recurrent predictor for spatial and temporal movement forecasting, with multi-step random walks traversing bidirectionally along an adaptive graph to model interdependency among body joints. In the temporal domain, existing methods rely on a single forward predictor with the produced motion deflecting to the drift route, which leads to error accumulations over time. We propose to supplement the forward predictor with a forward discriminator to alleviate such motion drift in the long term under adversarial training. The solution is further enhanced by a backward predictor and a backward discriminator to effectively reduce the error, such that the system can also look into the past to improve the prediction at early frames. The two-way spatial diffusion convolutions and two-way temporal predictors together form a quadruple network. Furthermore, we train our framework by modeling the velocity from observed motion dynamics instead of static poses to predict future movements that effectively reduces the discontinuity problem at early prediction. Our method outperforms the state of the arts on both 3D and 2D datasets, including the Human3.6M, CMU Motion Capture and Penn Action datasets. The results also show that our method correctly predicts both high-dynamic and low-dynamic moving trends with less motion drift.

show abstract

A Neural Temporal Model for Human Motion Prediction

Cited by 162 publications

References 21 publications

Continual Learning of Recurrent Neural Networks by Locally Aligning Distributed Representations

Continual Learning of Recurrent Neural Networks by Locally Aligning Distributed Representations

Human Motion Prediction Via Pattern Completion in Latent Representation Space

A Quadruple Diffusion Convolutional Recurrent Network for Human Motion Prediction

Contact Info

Product

Resources

About