Viewport-Aware Omnidirectional Video Streaming Using Visual Attention and Dynamic Tiles

Özçinar, Çağrı; Cabrera, Julián; Smolić, Aljoša

doi:10.1109/euvip.2018.8611777

Cited by 5 publications

(4 citation statements)

References 25 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Moreover, a novel buffer updating strategy is proposed to tackle the buffering problem of 360-degree video streaming. Ozcinar et al [12] introduced a new adaptive 360-degree video streaming system, using visual attention maps. The developed system was aimed at an enhanced quality of ODV streaming viewed in HMDs.…”

Section: Estimation-based Methodsmentioning

confidence: 99%

See 1 more Smart Citation

Optimized viewport‐adaptive 360‐degree video streaming

Chen

Ahmad

2021

CAAI Trans on Intel Tech

View full text Add to dashboard Cite

Because of the rapid growth of head-mounted displays and 5G networking deployment, 360-degree video has become increasingly popular. To generate the real experience of a virtual environment, 360-degree videos require an ultrahigh resolution and frame rate to cover an omnidirectional view. These two prerequisites impose challenges for the transmission bandwidth and storage capacity of 360-degree video streaming. To reduce bandwidth and storage waste while providing a good immersive experience, we propose an optimized viewport-adaptive 360-degree video streaming method using high-efficiency video coding tiling, motion-constrained tile sets and MPEG dynamic adaptive streaming over HTTP spatial relationship description. The paper describes the rigorous design of the optimized system, which can assign different bitrates to different tiles in the viewport. The experimental results show that the proposed streaming system compares favourably with existing methods in terms of bitrate savings and storage capacity reduction. This is an open access article under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs License, which permits use and distribution in any medium, provided the original work is properly cited, the use is non-commercial and no modifications or adaptations are made.

show abstract

Section: Estimation-based Methodsmentioning

confidence: 99%

“…Ozcinar et al. [12] introduced a new adaptive 360‐degree video streaming system, using visual attention maps. The developed system was aimed at an enhanced quality of ODV streaming viewed in HMDs.…”

Section: Background and Related Workmentioning

confidence: 99%

Optimized viewport‐adaptive 360‐degree video streaming

Chen

Ahmad

2021

CAAI Trans on Intel Tech

View full text Add to dashboard Cite

show abstract

“…Different approaches proposed the use of adaptive tile sizes [12][13][14][15]. The work in [12] defines the tile sizes based on the users preference, but the omnidirectional visual content is not taken into account neither the bit rate required for each individual tile, which does not allow to obtain optimised efficiency.…”

Section: Introductionmentioning

confidence: 99%

“…The work in [12] defines the tile sizes based on the users preference, but the omnidirectional visual content is not taken into account neither the bit rate required for each individual tile, which does not allow to obtain optimised efficiency. Other methods, such as those presented in [14] and [15] divide the 360°ERP images into three tiles, two for the poles and one for the equator region. Then the latter is recursively divided both horizontally and vertically into smaller tiles whose size is determined based on a quality metric.…”

Section: Introductionmentioning

confidence: 99%

Attention-driven tile splitting method for improved efficiency of omnidirectional versatile video coding

Carreira

Faria

Tavora

et al. 2021

2021 IEEE International Conference on Image Processing (ICIP)

View full text Add to dashboard Cite

A common approach used in omnidirectional video coding is based on frame splitting into tiles, allowing partial delivery of only the subset of tiles that is necessary to render the user's current viewing region, defined as a specific viewport or Field-of-View (FoV). Since tiles can be independently encoded, such mechanism provides a flexible solution for encoding planar representations with ultra-high definition (UHD), such as the Equirectangular Projection (ERP), using Versatile Video Coding (VVC). By only selecting and transmitting the coded data that is required to render the necessary FoV, rather than the full 360°, a great deal of bandwidth can be saved. While current solutions are based on splitting the omnidirectional video frames into tiles of equal size, this paper proposes a new approach based on adaptive tile size, driven by visual attention. Those regions where the visual attention is higher are partitioned in smaller tiles to obtain higher bit rate granularity, allowing to decode the most frequent FoVs with minimum out-of-FoV pixels and reduced bandwidth. Optimal tile boundaries are found by solving a lagrangian minimisation problem with a cost function that achieves the best tradeoff between the standard deviation and the average attention-weighted bit rate per tile. The experimental results show that an average of 7.17% and 17.73% of bit rate savings is obtained in comparison with conventional tilling methods for the commonly used FoVs of 90°×90°and 45°×45°, respectively.

show abstract