2021
DOI: 10.48550/arxiv.2108.00480
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Realised Volatility Forecasting: Machine Learning via Financial Word Embedding

Eghbal Rahimikia,
Stefan Zohren,
Ser-Huang Poon

Abstract: We develop FinText, a novel, state-of-the-art, financial word embedding from Dow Jones Newswires Text News Feed Database. Incorporating this word embedding in a machine learning model produces a substantial increase in volatility forecasting performance on days with volatility jumps for 23 NASDAQ stocks from 27 July 2007 to 18 November 2016. A simple ensemble model, combining our word embedding and another machine learning model that uses limit order book data, provides the best forecasting performance for bot… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 40 publications
0
1
0
Order By: Relevance
“…This has given rise to an increasing usage of volatility conditional portfolios (Harvey et al 2018), with different studies reporting an overall gain in their Sharpe ratio (Moreira and Muir 2017), as well as a reduction of the likelihood of observing extreme heavy-tailed returns in volatility scaled portfolios (Harvey et al 2018). The development of volatility forecasting models has consequently attracted broad research efforts, but most of the models used by practitioners are based on classic methodologies such as the GARCH model (Bollerslev poller 2020), Graph Neural Networks (GNN) (Chen and Robert 2021), Transformer models (Ramos-Pérez, Alonso-González, and Núñez-Velázquez 2021), and NLP-based word embedding techniques (Rahimikia and Poon 2020;Rahimikia, Zohren, and Poon 2021). Furthermore, models combining traditional volatility forecasting methods with deep-learning techniques can be found in the literature (Kim and Won 2018;Mademlis and Dritsakis 2021), as well as other approaches using DNN as calibration methods for implying volatility surfaces (Horvath, Muguruza, and Tomas 2019), proving how neural network-based approaches work as complex pricing function approximators.…”
Section: Introductionmentioning
confidence: 99%
“…This has given rise to an increasing usage of volatility conditional portfolios (Harvey et al 2018), with different studies reporting an overall gain in their Sharpe ratio (Moreira and Muir 2017), as well as a reduction of the likelihood of observing extreme heavy-tailed returns in volatility scaled portfolios (Harvey et al 2018). The development of volatility forecasting models has consequently attracted broad research efforts, but most of the models used by practitioners are based on classic methodologies such as the GARCH model (Bollerslev poller 2020), Graph Neural Networks (GNN) (Chen and Robert 2021), Transformer models (Ramos-Pérez, Alonso-González, and Núñez-Velázquez 2021), and NLP-based word embedding techniques (Rahimikia and Poon 2020;Rahimikia, Zohren, and Poon 2021). Furthermore, models combining traditional volatility forecasting methods with deep-learning techniques can be found in the literature (Kim and Won 2018;Mademlis and Dritsakis 2021), as well as other approaches using DNN as calibration methods for implying volatility surfaces (Horvath, Muguruza, and Tomas 2019), proving how neural network-based approaches work as complex pricing function approximators.…”
Section: Introductionmentioning
confidence: 99%