A Maximum Entropy Deep Reinforcement Learning Neural Tracker

Balaram, Shafa; Arulkumaran, Kai; Dai, Tianhong; Bharath, Anil A.

doi:10.1007/978-3-030-32692-0_46

Cited by 2 publications

(7 citation statements)

References 47 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In our work, the encoder of CKNet is implemented separately using deterministic and variational convolutional networks (DCKNet and VCKNet), in which spectral analysis is performed to study the relationship between them on spanning the latent state space to approximate the Koopman operator. Four Mujoco cases are modelled by DCKNet and are trained with soft actor-critic (SAC) [44] simultaneously. Two offline cases in OpenAI Gym (Gym for brevity in the rest of this paper) are modelled based on pre-collected datasets.…”

Section: Synchronouslymentioning

confidence: 99%

“…For offline training, all cases are trained with DCKNet and VCKNet separately. To implement back-propagation in VCKNet, the reparameterised technique [44,55] is applied to sample observables in the training process, that is…”

Section: Training Rules For Cknetmentioning

confidence: 99%

“…For online training cases in Mujoco, soft actor-critic (SAC) [44] is adopted to provide training data for DCKNet constantly along with the entire training process. The encoder is updated synchronously with the critic and the Koopman operator, while the decoder is only updated with the Koopman operator.…”

Section: Training Rules For Cknetmentioning

confidence: 99%

“…β ⋆ means corresponding learning rates of weights ⋆, where ⋆ ∈ fzc; K; zag. Algorithm 2 points out the updating order of all neural networks, and more details of SAC refer to [44].…”

Section: Training Rules For Cknetmentioning

confidence: 99%

See 3 more Smart Citations

A deep Koopman operator‐based modelling approach for long‐term prediction of dynamics with pixel‐level measurements

Xiao

Tang

et al. 2023

CAAI Trans on Intel Tech

View full text Add to dashboard Cite

Although previous studies have made some clear leap in learning latent dynamics from high‐dimensional representations, the performances in terms of accuracy and inference time of long‐term model prediction still need to be improved. In this study, a deep convolutional network based on the Koopman operator (CKNet) is proposed to model non‐linear systems with pixel‐level measurements for long‐term prediction. CKNet adopts an autoencoder network architecture, consisting of an encoder to generate latent states and a linear dynamical model (i.e., the Koopman operator) which evolves in the latent state space spanned by the encoder. The decoder is used to recover images from latent states. According to a multi‐step ahead prediction loss function, the system matrices for approximating the Koopman operator are trained synchronously with the autoencoder in a mini‐batch manner. In this manner, gradients can be synchronously transmitted to both the system matrices and the autoencoder to help the encoder self‐adaptively tune the latent state space in the training process, and the resulting model is time‐invariant in the latent space. Therefore, the proposed CKNet has the advantages of less inference time and high accuracy for long‐term prediction. Experiments are performed on OpenAI Gym and Mujoco environments, including two and four non‐linear forced dynamical systems with continuous action spaces. The experimental results show that CKNet has strong long‐term prediction capabilities with sufficient precision.

show abstract

Section: Synchronouslymentioning

confidence: 99%

Section: Training Rules For Cknetmentioning

confidence: 99%

Section: Training Rules For Cknetmentioning

confidence: 99%

Section: Training Rules For Cknetmentioning

confidence: 99%

See 2 more Smart Citations

A deep Koopman operator‐based modelling approach for long‐term prediction of dynamics with pixel‐level measurements

Xiao

Tang

et al. 2023

CAAI Trans on Intel Tech

View full text Add to dashboard Cite

show abstract

“…This approach was further improved by PNR ( Radojević and Meijering, 2017b , 2019 ) and PAT ( Skibbe et al , 2019 ) using Monte Carlo filtering. Zhang et al (2018) , Dai et al (2019) and Balaram et al (2019) reformulated the tracing as a behavior problem and introduced a deep reinforcement learning strategy to guide the tracing process. Athey et al (2022) connected the broken components traced by the Bayesian appearance imaging model employing a hidden Markov model.…”

Section: Automatic Tracing Algorithmsmentioning

confidence: 99%

Neuron tracing from light microscopy images: automation, deep learning and bench testing

et al. 2022

View full text Add to dashboard Cite

Motivation Large-scale neuronal morphologies are essential to neuronal typing, connectivity characterization and brain modeling. It is widely accepted that automation is critical to the production of neuronal morphology. Despite previous survey papers about neuron tracing from light microscopy data in the last decade, thanks to the rapid development of the field, there is a need to update recent progress in a review focusing on new methods and remarkable applications. Results This review outlines neuron tracing in various scenarios with the goal to help the community understand and navigate tools and resources. We describe the status, examples, and accessibility of automatic neuron tracing. We survey recent advances of the increasingly popular deep learning enhanced methods. We highlight the semi-automatic methods for single neuron tracing of mammalian whole brains as well as the resulting datasets, each containing thousands of full neuron morphologies. Finally, we exemplify the commonly used datasets and metrics for neuron tracing bench testing.

show abstract

A Maximum Entropy Deep Reinforcement Learning Neural Tracker

Cited by 2 publications

References 47 publications

A deep Koopman operator‐based modelling approach for long‐term prediction of dynamics with pixel‐level measurements

A deep Koopman operator‐based modelling approach for long‐term prediction of dynamics with pixel‐level measurements

Neuron tracing from light microscopy images: automation, deep learning and bench testing

Contact Info

Product

Resources

About