2019
DOI: 10.1007/978-3-030-38364-0_28
|View full text |Cite
|
Sign up to set email alerts
|

Reinforcement Learning in Stock Trading

Abstract: Using machine learning techniques in financial markets, particularly in stock trading, attracts a lot of attention from both academia and practitioners in recent years. Researchers have studied different supervised and unsupervised learning techniques to either predict stock price movement or make decisions in the market. In this paper we study the usage of reinforcement learning techniques in stock trading. We evaluate the approach on real-world stock dataset. We compare the deep reinforcement learning approa… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
14
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 34 publications
(14 citation statements)
references
References 35 publications
0
14
0
Order By: Relevance
“…In quantitative finance, stock trading is essentially making dynamic decisions, namely to decide where to trade, at what price, and what quantity, over a highly stochastic and complex stock market. As a result, DRL provides useful toolkits for stock trading [21,44,48,45,10,8,26]. Taking many complex financial factors into account, DRL trading agents build a multi-factor model and provide algorithmic trading strategies, which are difficult for human traders [3,47,24,22].…”
Section: Introductionmentioning
confidence: 99%
“…In quantitative finance, stock trading is essentially making dynamic decisions, namely to decide where to trade, at what price, and what quantity, over a highly stochastic and complex stock market. As a result, DRL provides useful toolkits for stock trading [21,44,48,45,10,8,26]. Taking many complex financial factors into account, DRL trading agents build a multi-factor model and provide algorithmic trading strategies, which are difficult for human traders [3,47,24,22].…”
Section: Introductionmentioning
confidence: 99%
“…They applied the idea of a rolling window, where the best algorithm is picked to trade in the following period. Recently, many researchers provide more DRL solutions for STP tasks [3,8,18]. However, most existing works are based on several assumptions, which limits the practicality.…”
Section: Drl In Financementioning
confidence: 99%
“…005 for three futures and 0. 001 for two stocks; the number of filters is 32; the parameters (S,M,L) for HC and I are set to (3,12,22) and set to (1,10,20) for J, NFLX and AAPL. For the INTRA module of the AuxOut, the learning rate is 0.…”
Section: Experimental Settingsmentioning
confidence: 99%
“…It turns out that one-layer and 100 hidden units are the best. As for AuxIn and the AuxOut, we tune the number of filters being {8, 16, 32, 64}, the parameter being (S,M, L) { (1,10,20), (3,12,22 ), (5,15, 25)} of the INTER module and the size of hidden states being {16, 32, 64} of the INTRA module.…”
Section: Experimental Settingsmentioning
confidence: 99%
See 1 more Smart Citation