A new method of concurrently visualizing states, values, and actions in reinforcement based brain machine interfaces

Bae, Jihye; Giraldo, Luis G. Sánchez; Pohlmeyer, Eric A.; Sanchez, Justin C.; Prı́ncipe, José C.

doi:10.1109/embc.2013.6610770

Cited by 1 publication

(1 citation statement)

References 6 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Using the proposed methodology introduced in [36], we can observe how the decoder effectively learns a good state to action mapping, and how neural states affect the prediction performance. Figure 15 shows how each participant (the agent and the user) influences the overall performance in both successful and missed trials, and how the agent adapts the environment.…”

Section: Experimental Results On Neural Decodingmentioning

confidence: 99%

Kernel Temporal Differences for Neural Decoding

Bae

Giraldo

Pohlmeyer

et al. 2015

Computational Intelligence and Neuroscience

Self Cite

View full text Add to dashboard Cite

We study the feasibility and capability of the kernel temporal difference (KTD)(λ) algorithm for neural decoding. KTD(λ) is an online, kernel-based learning algorithm, which has been introduced to estimate value functions in reinforcement learning. This algorithm combines kernel-based representations with the temporal difference approach to learning. One of our key observations is that by using strictly positive definite kernels, algorithm's convergence can be guaranteed for policy evaluation. The algorithm's nonlinear functional approximation capabilities are shown in both simulations of policy evaluation and neural decoding problems (policy improvement). KTD can handle high-dimensional neural states containing spatial-temporal information at a reasonable computational complexity allowing real-time applications. When the algorithm seeks a proper mapping between a monkey's neural states and desired positions of a computer cursor or a robot arm, in both open-loop and closed-loop experiments, it can effectively learn the neural state to action mapping. Finally, a visualization of the coadaptation process between the decoder and the subject shows the algorithm's capabilities in reinforcement learning brain machine interfaces.

show abstract

Section: Experimental Results On Neural Decodingmentioning

confidence: 99%