2022
DOI: 10.21203/rs.3.rs-1573462/v2
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Deep reinforcement learning identifies personalized intermittent androgen deprivation therapy for prostate cancer

Abstract: The evolution of drug resistance leads to treatment failure and tumor progression. Intermittent androgen deprivation therapy (IADT) helps responsive cancer cells compete with resistant cancer cells in intratumoral competition. However, conventional IADT is population-based and ignores the heterogeneous phenotypes of individual patients. To address this challenge, we developed a time-varied, mixed-effect, and generative Lotka-Volterra (tM-GLV) model to account for the heterogeneity of the evolution mechanism an… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
6
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(6 citation statements)
references
References 1 publication
0
6
0
Order By: Relevance
“…The approach we have presented here is not the first DRL framework to tackle adaptive therapy. The work of Lu et al [48] is focused solely on prostate cancer and used a different underlying model, but importantly trained the network on both the PSA as well as the senstitve and resistant populations; while PSA is easily measured in a blood draw there is no direct way to measure the numbers of sensitive and resistant cells in a real patient. While they were also able to produce significantly improved responses for a single patient, the generalisability of our approach, and the clear path for translation, make our implementation somewhat more robust.…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…The approach we have presented here is not the first DRL framework to tackle adaptive therapy. The work of Lu et al [48] is focused solely on prostate cancer and used a different underlying model, but importantly trained the network on both the PSA as well as the senstitve and resistant populations; while PSA is easily measured in a blood draw there is no direct way to measure the numbers of sensitive and resistant cells in a real patient. While they were also able to produce significantly improved responses for a single patient, the generalisability of our approach, and the clear path for translation, make our implementation somewhat more robust.…”
Section: Discussionmentioning
confidence: 99%
“…Beyond uncertainty in the tumor's parameters, we cannot expect all tumors to behave in a way consistent with the virtual patient model (1), given the heterogeneity observed clinically in tumor dynamics and treatment response. We therefore considered the robustness of the DRL framework to variation in the underlying model and therefore tumor dynamics, by evaluating our framework on a modified Lotka-Volterra model introduced by Lu et al [48] and reproduced in Supplementary Information 5.2. This model uses exponential growth instead of logistic, such that the tumor may grow indefinitely, with a modified progression criterion based on the growth of the resistant subpopulation alone.…”
Section: Drl Framework Can Be Robust To Variation In Patient Parametersmentioning
confidence: 99%
See 1 more Smart Citation
“…Section 3.5 also explores the robustness of the DRL framework to variation in patient dynamics. Through this, we introduce a modified Lotka-Volterra model introduced by Lu et al [17], to demonstrate that a pre-trained DRL network can adapt to changes in the underlying tumor dynamics. Explicitly, this model may be written (in non-dimensional form) as:…”
Section: S6 Drl Robustness To Model Variationmentioning
confidence: 99%
“…For this model, progression was defined as growth in the resistant population alone to 0.1K R . The model was parameterized according to Table S2, replicating values used by Lu et al [17], and chosen to ensure that the profile in question does reach progression. Evaluating a pre-trained DRL framework on this new model, it attained a TTP of 1506 ± 3 days, outperforming the AT50 TTP of 1119 days (Figure S3).…”
Section: S6 Drl Robustness To Model Variationmentioning
confidence: 99%