Generalizable control for quantum parameter estimation through reinforcement learning

Xu, Haifeng; Li, Junning; Liu, Liqiang; Wang, Yu; Yuan, Haidong; Wang, Xin

doi:10.1038/s41534-019-0198-z

Cited by 102 publications

(62 citation statements)

References 49 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Quantum discrimination and quantum estimation underlie many applications in quantum information science, including quantum hypothesis testing, quantum detection, and quantum sensing. While quantum control has been employed to improve the precision in quantum estimation [47][48][49][50][51][52][53][54][55][56], the use of quantum control in quantum discrimination remains scarce [57][58][59]. This is so despite the fact that one may expect quantum control to help identify fundamental performance bounds of quantum discrimination, similar to those found for quantum computation [6,60] or derive pulse shapes for improved performance with direct relevance to experiments [8,61].…”

Section: Introductionmentioning

confidence: 99%

Optimally controlled quantum discrimination and estimation

Basilewitsch

Yuan

Koch

2020

Phys. Rev. Research

Self Cite

View full text Add to dashboard Cite

Quantum discrimination and estimation are pivotal for many quantum technologies, and their performance depends on the optimal choice of probe state and measurement. Here we show that their performance can be further improved by suitably tailoring the pulses that make up the interferometer. Developing an optimal control framework and applying it to the discrimination and estimation of a magnetic field in the presence of noise, we find an increase in the overall achievable state distinguishability. Moreover, the maximum distinguishability can be stabilized for times that are more than an order of magnitude longer than the decoherence time.

show abstract

Section: Introductionmentioning

confidence: 99%

Optimally controlled quantum discrimination and estimation

Basilewitsch

Yuan

Koch

2020

Phys. Rev. Research

Self Cite

View full text Add to dashboard Cite

show abstract

“…A shallow depth may broaden exploration, a strategy typically found in Reinforcement Learning (RL) [30]. This has been powerfully combined with Deep Neural Networks (DNN) [31][32][33][34][35] and applied recently to quantum systems [36][37][38][39][40][41][42][43]. Unfortunately, single-step lookaheads are inherently local and thus require a slower learning rate, with no performance gain found over full-depth, domain-specialized (Hessian approximation) methods in QOCT.…”

mentioning

confidence: 99%

Global optimization of quantum dynamics with AlphaZero deep exploration

et al. 2020

View full text Add to dashboard Cite

While a large number of algorithms for optimizing quantum dynamics for different objectives have been developed, a common limitation is the reliance on good initial guesses, being either random or based on heuristics and intuitions. Here we implement a tabula rasa deep quantum exploration version of the Deepmind AlphaZero algorithm for systematically averting this limitation. AlphaZero employs a deep neural network in conjunction with deep lookahead in a guided tree search, which allows for predictive hidden variable approximation of the quantum parameter landscape. To emphasize transferability, we apply and benchmark the algorithm on three classes of control problems using only a single common set of algorithmic hyperparameters. AlphaZero achieves substantial improvements in both the quality and quantity of good solution clusters compared to earlier methods. It is able to spontaneously learn unexpected hidden structure and global symmetry in the solutions, going beyond even human heuristics. arXiv:1907.05672v1 [quant-ph]

show abstract

“…Also in the regime of local parameter estimation, where the parameter is already known to high precision (typically from previous measurements), actor-critic and proximal-policy-optimization RL algorithms were used to find policies to control the dynamics of quantum sensors [30][31][32]. There, the estimation of the precession frequency of a dissipative spin-1 2 particle was improved by adding a linear control to the dynamics in form of an additional controlled magnetic field [32].…”

Section: Introductionmentioning

confidence: 99%

Improving the dynamics of quantum sensors with reinforcement learning

2020

View full text Add to dashboard Cite

Recently proposed quantum-chaotic sensors achieve quantum enhancements in measurement precision by applying nonlinear control pulses to the dynamics of the quantum sensor while using classical initial states that are easy to prepare. Here, we use the cross-entropy method of reinforcement learning (RL) to optimize the strength and position of control pulses. Compared to the quantumchaotic sensors with periodic control pulses in the presence of superradiant damping, we find that decoherence can be fought even better and measurement precision can be enhanced further by optimizing the control. In some examples, we find enhancements in sensitivity by more than an order of magnitude. By visualizing the evolution of the quantum state, the mechanism exploited by the RL method is identified as a kind of spin-squeezing strategy that is adapted to the superradiant damping.

show abstract

Generalizable control for quantum parameter estimation through reinforcement learning

Cited by 102 publications

References 49 publications

Optimally controlled quantum discrimination and estimation

Optimally controlled quantum discrimination and estimation

Global optimization of quantum dynamics with AlphaZero deep exploration

Improving the dynamics of quantum sensors with reinforcement learning

Contact Info

Product

Resources

About