2019
DOI: 10.1007/978-3-030-32692-0_46
|View full text |Cite
|
Sign up to set email alerts
|

A Maximum Entropy Deep Reinforcement Learning Neural Tracker

Abstract: Tracking of anatomical structures has multiple applications in the field of biomedical imaging, including screening, diagnosing and monitoring the evolution of pathologies. Semi-automated tracking of elongated structures has been previously formulated as a task for deep reinforcement learning (DRL), albeit it remains a challenge. We introduce a maximum entropy continuous-action DRL neural tracker capable of training from scratch in a complex environment in the presence of high noise levels, Gaussian blurring a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
7
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(7 citation statements)
references
References 47 publications
0
7
0
Order By: Relevance
“…In our work, the encoder of CKNet is implemented separately using deterministic and variational convolutional networks (DCKNet and VCKNet), in which spectral analysis is performed to study the relationship between them on spanning the latent state space to approximate the Koopman operator. Four Mujoco cases are modelled by DCKNet and are trained with soft actor-critic (SAC) [44] simultaneously. Two offline cases in OpenAI Gym (Gym for brevity in the rest of this paper) are modelled based on pre-collected datasets.…”
Section: Synchronouslymentioning
confidence: 99%
See 3 more Smart Citations
“…In our work, the encoder of CKNet is implemented separately using deterministic and variational convolutional networks (DCKNet and VCKNet), in which spectral analysis is performed to study the relationship between them on spanning the latent state space to approximate the Koopman operator. Four Mujoco cases are modelled by DCKNet and are trained with soft actor-critic (SAC) [44] simultaneously. Two offline cases in OpenAI Gym (Gym for brevity in the rest of this paper) are modelled based on pre-collected datasets.…”
Section: Synchronouslymentioning
confidence: 99%
“…For offline training, all cases are trained with DCKNet and VCKNet separately. To implement back-propagation in VCKNet, the reparameterised technique [44,55] is applied to sample observables in the training process, that is…”
Section: Training Rules For Cknetmentioning
confidence: 99%
See 2 more Smart Citations
“…This approach was further improved by PNR ( Radojević and Meijering, 2017b , 2019 ) and PAT ( Skibbe et al , 2019 ) using Monte Carlo filtering. Zhang et al (2018) , Dai et al (2019) and Balaram et al (2019) reformulated the tracing as a behavior problem and introduced a deep reinforcement learning strategy to guide the tracing process. Athey et al (2022) connected the broken components traced by the Bayesian appearance imaging model employing a hidden Markov model.…”
Section: Automatic Tracing Algorithmsmentioning
confidence: 99%