2016
DOI: 10.48550/arxiv.1612.05695
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Reinforcement Learning Using Quantum Boltzmann Machines

Abstract: We investigate whether quantum annealers with select chip layouts can outperform classical computers in reinforcement learning tasks. We associate a transverse field Ising spin Hamiltonian with a layout of qubits similar to that of a deep Boltzmann machine (DBM) and use simulated quantum annealing (SQA) to numerically simulate quantum sampling from this system. We design a reinforcement learning algorithm in which the set of visible nodes representing the states and actions of an optimal policy are the first a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
27
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
5
5

Relationship

0
10

Authors

Journals

citations
Cited by 25 publications
(27 citation statements)
references
References 39 publications
0
27
0
Order By: Relevance
“…A large number of applications have been modeled in the quadratic case. For example, in combinatorial scientific computing [21][22][23][24], chemistry [25,26], and machine learning [27][28][29] In the following subsections, we give examples that consist of problems modeled as a HOBO with higher-order terms greater than two.…”
Section: Applicationsmentioning
confidence: 99%
“…A large number of applications have been modeled in the quadratic case. For example, in combinatorial scientific computing [21][22][23][24], chemistry [25,26], and machine learning [27][28][29] In the following subsections, we give examples that consist of problems modeled as a HOBO with higher-order terms greater than two.…”
Section: Applicationsmentioning
confidence: 99%
“…With these new chips having more couplers between the qubits, we will be able to embed shapes with more elements and hopefully determine smoother geometries. We will continue to focus on laying the foundation for solving practically relevant problems by means of quantum computing, quantum simulation, and quantum optimization [16,17,[28][29][30][31][32][33].…”
Section: Future Workmentioning
confidence: 99%
“…While D-Wave machines were designed to be solvers and not samplers, the operating principle of AQC suggests that those machines could be efficiently used to sample from Boltzmann distributions. In fact, Boltzmann sampling using a D-wave machine has already been investigated in the existing literature [4], [5], [6], where the authors aim to train Boltzmann Machines. Yet, the implemented protocol allowing to perform such sampling is usually difficult to reproduce.…”
Section: Introductionmentioning
confidence: 99%