2021
DOI: 10.1007/978-3-030-91581-0_34
|View full text |Cite
|
Sign up to set email alerts
|

Extending Transformer Decoder with Working Memory for Sequence to Sequence Tasks

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 4 publications
0
1
0
Order By: Relevance
“…In working memory implementation (Sagirova & Burtsev, 2022), memory is represented by M additional tokens in the decoder input. The Transformer decoder generates, stores, and retrieves M working memory tokens in the same way it predicts the translation sequence.…”
Section: Memtransformer Memctrlmentioning
confidence: 99%
“…In working memory implementation (Sagirova & Burtsev, 2022), memory is represented by M additional tokens in the decoder input. The Transformer decoder generates, stores, and retrieves M working memory tokens in the same way it predicts the translation sequence.…”
Section: Memtransformer Memctrlmentioning
confidence: 99%