2021
DOI: 10.1080/14697688.2021.2001032
|View full text |Cite
|
Sign up to set email alerts
|

What is the value of the cross-sectional approach to deep reinforcement learning?

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
10
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
3

Relationship

1
7

Authors

Journals

citations
Cited by 13 publications
(10 citation statements)
references
References 26 publications
0
10
0
Order By: Relevance
“…Examples of reward signals include portfolio return (Jiang et al., 2017; Pendharkar & Cusatis, 2018; Yu et al., 2019), (differential) Sharpe ratio (Du et al., 2016; Pendharkar … Cusatis, 2018), and profit (Du et al., 2016). The benchmark strategies include Constantly Rebalanced Portfolio (CRP) (Yu et al., 2019; Jiang et al., 2017) where at each period the portfolio is rebalanced to the initial wealth distribution among the assets, and the buy‐and‐hold or do‐nothing strategy (Park et al., 2020; Aboussalah, 2020), which does not take any action but rather holds the initial portfolio until the end. The performance measures studied in these papers include the Sharpe ratio (Yu et al., 2019; Wang … Zhou, 2020; Xiong et al., 2018; Jiang et al., 2017; Liang et al., 2018; Park et al., 2020; Wang, 2019), the Sortino ratio (Yu et al., 2019), portfolio returns (Aboussalah, 2020; Liang et al., 2018; Park et al., 2020; Wang, 2019; Xiong et al., 2018; Yu et al., 2019), portfolio values (Jiang et al., 2017; Pendharkar & Cusatis, 2018; Xiong et al., 2018), and cumulative profits (Du et al., 2016).…”
Section: Applications In Financementioning
confidence: 99%
See 2 more Smart Citations
“…Examples of reward signals include portfolio return (Jiang et al., 2017; Pendharkar & Cusatis, 2018; Yu et al., 2019), (differential) Sharpe ratio (Du et al., 2016; Pendharkar … Cusatis, 2018), and profit (Du et al., 2016). The benchmark strategies include Constantly Rebalanced Portfolio (CRP) (Yu et al., 2019; Jiang et al., 2017) where at each period the portfolio is rebalanced to the initial wealth distribution among the assets, and the buy‐and‐hold or do‐nothing strategy (Park et al., 2020; Aboussalah, 2020), which does not take any action but rather holds the initial portfolio until the end. The performance measures studied in these papers include the Sharpe ratio (Yu et al., 2019; Wang … Zhou, 2020; Xiong et al., 2018; Jiang et al., 2017; Liang et al., 2018; Park et al., 2020; Wang, 2019), the Sortino ratio (Yu et al., 2019), portfolio returns (Aboussalah, 2020; Liang et al., 2018; Park et al., 2020; Wang, 2019; Xiong et al., 2018; Yu et al., 2019), portfolio values (Jiang et al., 2017; Pendharkar & Cusatis, 2018; Xiong et al., 2018), and cumulative profits (Du et al., 2016).…”
Section: Applications In Financementioning
confidence: 99%
“…(2021) embedded alpha portfolio strategies into a deep policy‐based method and designed a framework, which is easier to interpret. Using Actor–Critic methods, Aboussalah (2020) combined the mean–variance framework (the actor determines the policy using the mean–variance framework) and the Kelly Criterion framework (the critic evaluates the policy using their growth rate). They studied eight policy‐based algorithms including DPG, DDPG, and PPO, among which DPG was shown to achieve the best performance.…”
Section: Applications In Financementioning
confidence: 99%
See 1 more Smart Citation
“…Brandt et al (2009) also assume parametric portfolio policies that exploit the characteristics of the cross-section of returns in an optimal asset allocation context. Aboussalah et al (2021) apply mean-variance (and growth optimal investing) strategies in a cross-sectional setting.…”
Section: Literature Reviewmentioning
confidence: 99%
“…In our work, we directly optimized the cross-holdings between organizations from a global regulator's perspective in order to mitigate systemic risk of the global network in the face of exogenous shocks. The reinforcement learning agents in 14 and 15 would be good candidates for extending 13 work to use multi-agent learning to better understand and mitigate systemic risk.…”
mentioning
confidence: 99%