Enhancing Cross-Sectional Currency Strategies by Context-Aware Learning to Rank with Self-Attention

Poh, Daniel; Lim, Bryan; Zohren, Stefan; Roberts, Stephen

doi:10.48550/arxiv.2105.10019

Cited by 2 publications

(3 citation statements)

References 0 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…With conventional parameter sharing techniques, if a neural model is used to learn the source task, then a target model can be constructed by directly retaining most of its layers [70]. Motivated by the empirical superiority of context-aware LTR models using the attention mechanism over standard rankers [49,51], we utilise a hybrid approach -running the pre-trained source Transformer's encoder block 𝝃 𝑆 (•) as an additional feature extractor operating in parallel with the target Transformer's block 𝝃 𝑇 (•):…”

Section: Target Model Architecturementioning

confidence: 99%

“…To train the source model needed for knowledge transfer, we make use of the same set of daily data relating to 30 currency pairs ∥ as per [4,51] obtained from the Bank for International Settlements (BIS) [3] spanning May-2000 to Dec-2021 which we again downsample to the weekly frequency. To measure risk aversion, we use the daily close of the VIX historical data from the Cboe Global Markets [12], where a week is labelled risk-off if it contains one or more days when the VIX is 5% higher than its 60-day moving average.…”

Section: Performance Evaluation 51 Dataset Overviewmentioning

confidence: 99%

“…However, unlike the computer vision and NLP communities which have rapidly adopted transformerbased solutions in response to their domain-specific transfer learning problems, we note that a corresponding development in IR is surprisingly absent. Inspired by recent works [48,49] which exploits the Transformer's attention module to improve ranking performance as well as a first adaptation in the context of finance [51], we similarly make use of this mechanism and propose the Fused Encoder Networks (FEN). Central to the model's architecture is the usage of dual encoder blocks -one of which is optimised on a larger, but related, upstream dataset and then deployed in conjunction with the other encoder block that is calibrated on a smaller downstream dataset of interest.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Transfer Ranking in Finance: Applications to Cross-Sectional Momentum with Data Scarcity

2022

Self Cite

View full text Add to dashboard Cite

Section: Target Model Architecturementioning

confidence: 99%

Section: Performance Evaluation 51 Dataset Overviewmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Transfer Ranking in Finance: Applications to Cross-Sectional Momentum with Data Scarcity

2022

Self Cite

View full text Add to dashboard Cite

Fast training of a transformer for global multi-horizon time series forecasting on tensor processing units

et al. 2022

View full text Add to dashboard Cite

Time Series Forecasting (TSF) is essential to key domains, and the transformer neural network has advanced the state-of-the-art on global, multi-horizon TSF benchmarks. The quadratic time and memory complexity of the vanilla transformer (VT) hinders its application to big data environments; therefore, multiple efficient variants of the VT that lower complexity via sparse self-attention have been proposed. However, less complex algorithms do not directly produce faster executions, and machine learning models for big data are typically trained on accelerators designed for dense-matrix computation that render slower performance with sparse matrices. To better compare the accuracyspeed trade-off of the VT and its variants, it is essential to test them on such accelerators. We implemented a cloud-based VT on Tensor Processing Units to address this task. Experiments on large-scale datasets show that our transformer outperforms two reference models on accuracy while reducing training times from hours to under two minutes.

show abstract

Enhancing Cross-Sectional Currency Strategies by Context-Aware Learning to Rank with Self-Attention

Cited by 2 publications

References 0 publications

Transfer Ranking in Finance: Applications to Cross-Sectional Momentum with Data Scarcity

Transfer Ranking in Finance: Applications to Cross-Sectional Momentum with Data Scarcity

Fast training of a transformer for global multi-horizon time series forecasting on tensor processing units

Contact Info

Product

Resources

About