1990
DOI: 10.1109/5.58337
|View full text |Cite
|
Sign up to set email alerts
|

Backpropagation through time: what it does and how to do it

Abstract: Backpropagation is now the most widely used tool in the field

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

4
1,722
0
38

Year Published

1998
1998
2016
2016

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 4,012 publications
(1,764 citation statements)
references
References 11 publications
4
1,722
0
38
Order By: Relevance
“…(7) by using BPTT (Werbos, 1990). BPTT efficiently calculates the gradient of J( x 0 , w) with respect to the weight vector of the action network, w, for a given trajectory with arbitrary initial state x 0 .…”
Section: Backpropagation Through Time Algorithmmentioning
confidence: 99%
See 3 more Smart Citations
“…(7) by using BPTT (Werbos, 1990). BPTT efficiently calculates the gradient of J( x 0 , w) with respect to the weight vector of the action network, w, for a given trajectory with arbitrary initial state x 0 .…”
Section: Backpropagation Through Time Algorithmmentioning
confidence: 99%
“…These variables hold the "ordered partial derivatives" of J with respect to the given variable name, so that for example J x k = ∂ + J ∂ x k . This ordered partial derivative, as defined by Werbos (Werbos, 1990;Werbos et al, 1992), represents the derivative of J with respect to x k , assuming all other variables which depend upon x k in lines 3-7 of Alg. 1 are not fixed, and thus their derivatives will influence the value of ∂ + J ∂ x k via the chain rule.…”
Section: Backpropagation Through Time Algorithmmentioning
confidence: 99%
See 2 more Smart Citations
“…To update the model parameters, we have used Truncated Back-Propagation-Through-Time (T-BPTT) (Werbos, 1990) (Werbos, 1990) with stochastic gradient descent. We fixed the depth of BPTT to 7 for all the models.…”
Section: Implementation Detailsmentioning
confidence: 99%