2001
DOI: 10.1109/72.935097
|View full text |Cite
|
Sign up to set email alerts
|

Learning to trade via direct reinforcement

Abstract: We present methods for optimizing portfolios, asset allocations, and trading systems based on direct reinforcement (DR). In this approach, investment decision-making is viewed as a stochastic control problem, and strategies are discovered directly. We present an adaptive algorithm called recurrent reinforcement learning (RRL) for discovering investment policies. The need to build forecasting models is eliminated, and better trading performance is obtained. The direct reinforcement approach differs from dynamic… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
294
0
1

Year Published

2007
2007
2024
2024

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 401 publications
(295 citation statements)
references
References 45 publications
0
294
0
1
Order By: Relevance
“…In [15], an adaptive algorithm, which was named recurrent reinforcement learning, for direct reinforcement was proposed, and it was used to learn an investment strategy online. Later, Moody and Saffell [16] have shown how to train trading systems via direct reinforcement. Performance of the learning algorithm proposed in [16] was demonstrated through the intraday currency trader and monthly asset allocation system for S&P 500 stock index and T-Bills.…”
mentioning
confidence: 99%
See 1 more Smart Citation
“…In [15], an adaptive algorithm, which was named recurrent reinforcement learning, for direct reinforcement was proposed, and it was used to learn an investment strategy online. Later, Moody and Saffell [16] have shown how to train trading systems via direct reinforcement. Performance of the learning algorithm proposed in [16] was demonstrated through the intraday currency trader and monthly asset allocation system for S&P 500 stock index and T-Bills.…”
mentioning
confidence: 99%
“…Later, Moody and Saffell [16] have shown how to train trading systems via direct reinforcement. Performance of the learning algorithm proposed in [16] was demonstrated through the intraday currency trader and monthly asset allocation system for S&P 500 stock index and T-Bills.…”
mentioning
confidence: 99%
“…Using some measure of risk adjusted return (as in [8]) might be of interest in the context of simulated electricity trade and this would simply involve the definition of a new task and would not require any modification of the environment.…”
Section: Agent Tasksmentioning
confidence: 99%
“…In [8] and [9] a recurrent gradient method was used to optimise financial investment performance without price forecasting and in [38] a modified version of REINFORCE is used to simulate a marketplace for grid computing resources.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation