2022
DOI: 10.1007/s10489-022-04217-5
|View full text |Cite
|
Sign up to set email alerts
|

From deterministic to stochastic: an interpretable stochastic model-free reinforcement learning framework for portfolio optimization

Abstract: As a fundamental problem in algorithmic trading, portfolio optimization aims to maximize the cumulative return by continuously investing in various financial derivatives within a given time period. Recent years have witnessed the transformation from traditional machine learning trading algorithms to reinforcement learning algorithms due to their superior nature of sequential decision making. However, the exponential growth of the imperfect and noisy financial data that is supposedly leveraged by the determinis… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 8 publications
(4 citation statements)
references
References 37 publications
0
4
0
Order By: Relevance
“…F g,2 is then distributed to each client for local multiscale feature fusion. In such a way, the aggregated features F g,n at the n-th scale are obtained, and F g,n is sent down to the client for local multiscale feature fusion to obtain the fused features F f ,i,n , as shown in Equation (5).…”
Section: Multiscale Recursive Attention Gate Federation Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…F g,2 is then distributed to each client for local multiscale feature fusion. In such a way, the aggregated features F g,n at the n-th scale are obtained, and F g,n is sent down to the client for local multiscale feature fusion to obtain the fused features F f ,i,n , as shown in Equation (5).…”
Section: Multiscale Recursive Attention Gate Federation Methodsmentioning
confidence: 99%
“…When the data quality is better, the spectrum analysis can find the characteristic frequency of the fault and realize fault diagnosis. However, when the data quality is poor, the feature frequency of the fault is difficult to present on the spectrum [ 4 , 5 ].…”
Section: Introductionmentioning
confidence: 99%
“… Song et al (2022) introduced an innovative method for optimizing investment portfolios utilizing stochastic reinforcement learning. To validate their model, the authors conducted an empirical analysis using data from 22 stocks with the highest trading volume in the S&P 500 index from 2005 to 2020.…”
Section: Theoretical Aspectsmentioning
confidence: 99%
“…The financial markets display a high degree of dynamism and complexity, making the selection of an optimal combination of assets for constructing an investment portfolio a formidable challenge ( Song et al, 2022 ; Xiao & Ihnaini, 2023 among others). In this context, scholars have conducted thorough investigations into Modern Portfolio Theory (MPT) of Markowitz (1952) since its inception.…”
Section: Introductionmentioning
confidence: 99%