2020 IEEE International Symposium on Circuits and Systems (ISCAS) 2020
DOI: 10.1109/iscas45731.2020.9180528
|View full text |Cite
|
Sign up to set email alerts
|

LSTM-Based Viewpoint Prediction for Multi-Quality Tiled Video Coding in Virtual Reality Streaming

Abstract: Virtual reality (VR) streaming is impaired by the large amount of data required to deliver 360-degree video resulting in low-quality end user experience when network bandwidth is limited, or latency is high. To address these challenges, proposed in this paper is a novel method for viewpoint prediction for long-term horizons in VR streaming. This method uses a long short-term memory (LSTM) encoder-decoder network to carry out a sequence-to-sequence prediction. To enhance the results obtained by this network, ex… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
7
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 12 publications
(7 citation statements)
references
References 13 publications
0
7
0
Order By: Relevance
“…Trajectory-based approaches [12]- [16], [22]- [24], [40]- [42] predict future viewing direction from one user's (single-user) or other users' (cross-user) historical head movement trajectories. [12], [40]- [42] proposed to use historical head movement data to predict future FoV.…”
Section: B Trajectory-based Prediction Methodsmentioning
confidence: 99%
“…Trajectory-based approaches [12]- [16], [22]- [24], [40]- [42] predict future viewing direction from one user's (single-user) or other users' (cross-user) historical head movement trajectories. [12], [40]- [42] proposed to use historical head movement data to predict future FoV.…”
Section: B Trajectory-based Prediction Methodsmentioning
confidence: 99%
“…1) Content-Agnostic Approaches: Several existing contentagnostic approaches predict future viewing position using various prediction methods such as, average [17], linear regression (LR) [17], [178], Dead Reckoning (DR) [179], clustering [177], [180], [181], straightforward machine learning (ML) [182]- [184], and encoder-decoder architecture [183], [185]. Qian et al [17] used average, linear regression, and weighted linear regression models for viewport prediction and then entirely streamed those tiles that will overlap with the estimated viewport.…”
Section: A Viewport Predictionmentioning
confidence: 99%
“…Jiang et al [3] applied a model based on long short-term memory (LSTM) to predict future head rotations. Jamali et al [4] used a LSTM encoder-decoder network to perform sequence-to-sequence prediction. Nasrabadi et al [5] proposed a clustering-based method to estimate the user's future viewport.…”
Section: Viewport Predictionmentioning
confidence: 99%
“…e most advanced VR streaming research mainly focuses on viewport prediction [1][2][3][4][5][6][7][8][9][10][11][12][13][14][15][16]. e existing solutions [1,[11][12][13] suggest prefetching all the tiles of each segment, and higher quality of prefetching predicts the tiles in the viewport.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation