2018
DOI: 10.2514/1.i010553
|View full text |Cite
|
Sign up to set email alerts
|

Adaptive Simulation-Based Training of Artificial-Intelligence Decision Makers Using Bayesian Optimization

Abstract: This work studies how an AI-controlled dog-fighting agent with tunable decisionmaking parameters can learn to optimize performance against an intelligent adversary, as measured by a stochastic objective function evaluated on simulated combat engagements. Gaussian process Bayesian optimization (GPBO) techniques are developed to automatically learn global Gaussian Process (GP) surrogate models, which provide statistical performance predictions in both explored and unexplored areas of the parameter space. This al… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
8
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
4
1

Relationship

1
4

Authors

Journals

citations
Cited by 6 publications
(8 citation statements)
references
References 29 publications
0
8
0
Order By: Relevance
“…To serve as a credible adversary, the AI agent must determine the human participant's skill level and then adapt accordingly (Israelsen et al, 2018). For this purpose, the AI agent can be trained in an interactive environment against an intelligent (AI) adversary to learn how to optimize performance.…”
Section: Armed Forces Trainingmentioning
confidence: 99%
See 1 more Smart Citation
“…To serve as a credible adversary, the AI agent must determine the human participant's skill level and then adapt accordingly (Israelsen et al, 2018). For this purpose, the AI agent can be trained in an interactive environment against an intelligent (AI) adversary to learn how to optimize performance.…”
Section: Armed Forces Trainingmentioning
confidence: 99%
“…For this purpose, the AI agent can be trained in an interactive environment against an intelligent (AI) adversary to learn how to optimize performance. For this AI-AI interaction in an interactive virtual environment, Gaussian process Bayesian optimization techniques have been used to optimize the AI agent's performance (Israelsen et al, 2018).…”
Section: Armed Forces Trainingmentioning
confidence: 99%
“…Furthermore, the best local optimum may be undesirable for learning with sparse data early on in the GPBO process, since the associated Θ values typically overfit the training data [7], [4]. This behavior is especially important to consider when trying to minimize the number of simulations for GPBO [8].…”
Section: A Bayesian Optimization Theorymentioning
confidence: 99%
“…Note that the use of a non-convex optimization technique like DIRECT makes sense here, since they key idea behind Bayesian optimization is that evaluation of a(q) at multiple test points q will be cheaper and faster than evaluating y at those points directly. In this work, we use the classical approach of selecting a single new design point q on each iteration of GPBO, although variations to sample multiple design points at once or repeatedly on each iteration are also possible [8].…”
Section: A Bayesian Optimization Theorymentioning
confidence: 99%
“…That is, BVR air combat emphasizes the distance game and the key of evasive maneuvers in these situations is tactical planning. The second is "dogfight", in which the main features are relatively close distance, high-dynamic, and intense confrontation [4]. There is more of an emphasis on the game of space angle.…”
Section: Introductionmentioning
confidence: 99%