1998
DOI: 10.1002/(sici)1099-131x(1998090)17:5/6<441::aid-for707>3.3.co;2-r
|View full text |Cite
|
Sign up to set email alerts
|

Performance functions and reinforcement learning for trading systems and portfolios

Abstract: We propose to train trading systems and portfolios by optimizing objective functions that directly measure trading and investment performance. Rather than basing a trading system on forecasts or training via a supervised learning algorithm using labelled trading data, we train our systems using recurrent reinforcement learning (RRL) algorithms. The performance functions that we consider for reinforcement learning are profit or wealth, economic utility, the Sharpe ratio and our proposed differential Sharpe rati… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

1
14
0
1

Year Published

1998
1998
2012
2012

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 41 publications
(16 citation statements)
references
References 0 publications
1
14
0
1
Order By: Relevance
“…While the reinforcement learning is promising, introduction of these considerations will make the problem more complex. Therefore, one of the future AND 15 research problems will be to make the reinforcement learning formulation with these considerations tractable.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…While the reinforcement learning is promising, introduction of these considerations will make the problem more complex. Therefore, one of the future AND 15 research problems will be to make the reinforcement learning formulation with these considerations tractable.…”
Section: Discussionmentioning
confidence: 99%
“…Another portfolio management system built by use of Q-learning was presented in [14] where absolute profit and relative risk-adjusted profit were considered as performance functions to train a system. In [15], an adaptive algorithm, which was named recurrent reinforcement learning, for direct reinforcement was proposed, and it was used to learn an investment strategy online. Later, Moody and Saffell [16] have shown how to train trading systems via direct reinforcement.…”
mentioning
confidence: 99%
“…For some process-dependent investors (Moody et al 1998), it is important to evaluate risk and risk-adjusted return of portfolios (Sharpe 1963(Sharpe , 1994. One common way to achieve this is to use annualized standard deviation of daily returns to measure the volatility risk and annualized Sharpe Ratio (SR) to evaluate the risk-adjusted return.…”
Section: Performance Criteriamentioning
confidence: 99%
“…In [8] and [9] a recurrent gradient method was used to optimise financial investment performance without price forecasting and in [38] a modified version of REINFORCE is used to simulate a marketplace for grid computing resources.…”
Section: Related Workmentioning
confidence: 99%
“…They can be used with function approximation techniques without suffering from the problems that mar value function based methods. They have been successfully applied in several types of operational setting, including robotic control [7], financial trading [8], [9] and network routing [10], but they have not been previously applied in simulations of competitive electricity trade.…”
mentioning
confidence: 99%