2022 IEEE Symposium Series on Computational Intelligence (SSCI) 2022
DOI: 10.1109/ssci51031.2022.10022262
|View full text |Cite
|
Sign up to set email alerts
|

Periodic Extrapolative Generalisation in Neural Networks

Abstract: We break the linear link between the layer size and its inference cost by introducing the fast feedforward (FFF) architecture, a log-time alternative to feedforward networks. We show that FFFs give comparable performance to feedforward networks at an exponential fraction of their inference cost, are quicker to deliver performance compared to mixtureof-expert networks, and can readily take the place of either in transformers. Pushing FFFs to the absolute limit, we train a vision transformer to perform single-ne… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
references
References 19 publications
0
0
0
Order By: Relevance