Proceedings of the Conference Recent Advances in Natural Language Processing - Large Language Models for Natural Language Proce 2023
DOI: 10.26615/978-954-452-092-2_090
|View full text |Cite
|
Sign up to set email alerts
|

Forming Trees with Treeformers

Nilay Patel,
Jeffrey Flanigan

Abstract: Human language is known to exhibit a nested, hierarchical structure, allowing us to form complex sentences out of smaller pieces. However, many state-of-the-art neural networks models such as Transformers have no explicit hierarchical structure in their architecture-that is, they don't have an inductive bias toward hierarchical structure. Additionally, Transformers are known to perform poorly on compositional generalization tasks which require such structures. In this paper, we introduce Treeformer, a general-… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 27 publications
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?