Discovering faster matrix multiplication algorithms with reinforcement learning

Fawzi, Alhussein; Balog, Matej; Huang, Aja; Hubert, Thomas; Romera-Paredes, Bernardino; Barekatain, Mohammadamin; Novikov, Alexander; Ruiz, Francisco J. R.; Schrittwieser, Julian; Świrszcz, Grzegorz; Silver, David; Hassabis, Demis; Kohli, Pushmeet

doi:10.1038/s41586-022-05172-4

Cited by 275 publications

(142 citation statements)

References 39 publications

Supporting

Mentioning

140

Contrasting

Unclassified

Order By: Relevance

“…The development of mathematical proofs and arguments seems to be one of the most difficult challenges. Nevertheless, some barriers have already fallen with the discovery of new multiplication algorithms [5].…”

Section: Discussionmentioning

confidence: 99%

Inferring the fractional nature of Wu Baleanu trajectories

Conejero

Garibo‐i‐Orts

Lizama

2022

Preprint

View full text Add to dashboard Cite

show abstract

Section: Discussionmentioning

confidence: 99%

Inferring the fractional nature of Wu Baleanu trajectories

Conejero

Garibo‐i‐Orts

Lizama

2022

Preprint

View full text Add to dashboard Cite

show abstract

“…Finally, we have to note that our scheme is based on computer-aided efficient search procedures for local relation enumerations. However, we are also actively looking for systematic redundancy generation schemes for recursive matrix multiplication methods as well as deep learning techniques recently fueled by DeepMind initiatives for faster matrix multiplications [18].…”

Section: B Proposed Methodsmentioning

confidence: 99%

Fault-Tolerant Strassen-Like Matrix Multiplication

Güney

Muhtasham

Arslan

2020

2020 28th Signal Processing and Communications Applications Conference (SIU)

View full text Add to dashboard Cite

In this study, we propose a simple method for fault-tolerant Strassen-like matrix multiplications. The proposed method is based on using two distinct Strassen-like algorithms instead of replicating a given one. We have realized that using two different algorithms, new check relations arise resulting in more local computations. These local computations are found using computer aided search. To improve performance, special parity (extra) sub-matrix multiplications (PSMMs) are generated (two of them) at the expense of increasing communication/computation cost of the system. Our preliminary results demonstrate that the proposed method outperforms a Strassen-like algorithm with two copies and secures a very close performance to three copy version using only 2 PSMMs, reducing the total number of compute nodes by around 24% i.e., from 21 to 16.

show abstract

“…In the past decade, machine learning has drawn great attention from almost all natural science and engineering communities, such as mathematics [1][2][3], physics [4][5][6][7][8][9][10], biology [11][12][13], and materials sciences [14][15][16], and has been widely used in various aspects of modern society, e.g., automatic driving systems, face recognition, fraud detection, expert recommendation system, speech enhancement, and natural language processing, etc. Especially, the deep learning techniques based on the artificial neural networks [17,18] have become the most popular and dominant machine learning approaches progressively, and their interactions with many-body physics have been intensively explored in recent years.…”

Section: Introductionmentioning

confidence: 99%

“…Similarly, the 2nd scale transformation is composed of two isometries U (2) s, and each U (2) maps {σ (1) } to variables {σ (2) } sitting on the red lines. U (3) constitutes the last scale transformation, and maps {σ (2) } to variables {σ (3) } sitting on the black lines. Eventually, the operator is represented in terms of {σ (3) }, and this completes the full RG process.…”

Section: Introductionmentioning

confidence: 99%

Exploring explicit coarse-grained structure in artificial neural networks

Xi-Ci¹,

Xie²,

Yang³

2022

Preprint

View full text Add to dashboard Cite

We propose to employ the hierarchical coarse-grained structure in the artificial neural networks explicitly to improve the interpretability without degrading performance. The idea has been applied in two situations. One is a neural network called TaylorNet, which aims to approximate the general mapping from input data to output result in terms of Taylor series directly, without resorting to any magic nonlinear activations. The other is a new setup for data distillation, which can perform multilevel abstraction of the input dataset and generate new data that possesses the relevant features of the original dataset and can be used as references for classification. In both cases, the coarse-grained structure plays an important role in simplifying the network and improving both the interpretability and efficiency. The validity has been demonstrated on MNIST and CIFAR-10 datasets. Further improvement and some open questions related are also discussed.

show abstract

Discovering faster matrix multiplication algorithms with reinforcement learning

Cited by 275 publications

References 39 publications

Inferring the fractional nature of Wu Baleanu trajectories

Inferring the fractional nature of Wu Baleanu trajectories

Fault-Tolerant Strassen-Like Matrix Multiplication

Exploring explicit coarse-grained structure in artificial neural networks

Contact Info

Product

Resources

About