Optimizing Fixation Prediction Using Recurrent Neural Networks for 360∘ Video Streaming in Head-Mounted Virtual Reality

首页> 外文期刊>IEEE transactions on multimedia >Optimizing Fixation Prediction Using Recurrent Neural Networks for 360∘ Video Streaming in Head-Mounted Virtual Reality

【24h】

Optimizing Fixation Prediction Using Recurrent Neural Networks for 360∘ Video Streaming in Head-Mounted Virtual Reality

机译：头戴式虚拟现实中使用递归神经网络优化360°视频流的注视预测

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We study the problem of predicting the viewing probability of different parts of 360. videos when streaming them to head-mounted displays. We propose a fixation prediction network based on recurrent neural network, which leverages sensor and content features. The content features are derived by computer vision (CV) algorithms, which may suffer from inferior performance due to various types of distortion caused by diverse 360. video projection models. We propose a unified approach with overlapping virtual viewports to eliminate such negative effects, andwe evaluate our proposed solution using severalCValgorithms, such as saliency detection, face detection, and object detection. We find that overlapping virtual viewports increase the performance of these existing CV algorithms that were not trained for 360. videos. We next fine-tune our fixation prediction network with diverse design options, including: 1) with or without overlapping virtual viewports, 2) with or without future content features, and 3) different feature sampling rates. We empirically choose the best fixation prediction network and use it in a 360. video streaming system. We conduct extensive trace-driven simulations with a large-scale dataset to quantify the performance of the 360. video streaming system with different fixation prediction algorithms. The results show that our proposed fixation prediction network outperforms other algorithms in several aspects, such as: 1) achieving comparable video quality (average gaps between -0.05 and 0.92 dB), 2) consuming much less bandwidth (average bandwidth reduction by up to 8Mb/s), 3) reducing the rebuffering time (on average 40 s in bandwidth-limited 4G cellular networks), and 4) running in real-time (at most 124 ms).

机译：我们研究预测将360.video视频流传输到头戴式显示器时不同部分的观看可能性的问题。我们提出了一种基于递归神经网络的注视预测网络，它利用了传感器和内容特征。内容功能是由计算机视觉（CV）算法派生的，由于各种360视频投影模型导致的各种类型的失真，其性能可能会有所下降。我们提出了一种具有重叠虚拟视口的统一方法来消除此类负面影响，并使用几种C评估算法（例如显着性检测，人脸检测和对象检测）来评估我们提出的解决方案。我们发现重叠的虚拟视口可以提高这些现有的CV算法的性能，这些算法未经360视频训练。接下来，我们将使用多种设计选项来微调注视预测网络，包括：1）具有或不具有重叠的虚拟视口; 2）具有或不具有未来的内容特征;以及3）不同的特征采样率。我们根据经验选择最佳的注视预测网络，并将其用于360.视频流系统。我们使用大规模数据集进行了广泛的跟踪驱动模拟，以量化具有不同注视预测算法的360.视频流系统的性能。结果表明，我们提出的固视预测网络在几个方面优于其他算法，例如：1）获得可比的视频质量（-0.05和0.92 dB之间的平均间隙），2）占用更少的带宽（平均带宽减少多达8Mb） / s），3）减少重新缓冲时间（在带宽受限的4G蜂窝网络中平均为40 s），以及4）实时运行（最多124 ms）。

著录项

来源
《IEEE transactions on multimedia》 |2020年第3期|744-759|共16页
作者

展开▼
作者单位

Natl Tsing Hua Univ Dept Comp Sci Hsinchu 30013 Taiwan;

Natl Chiao Tung Univ Dept Comp Sci Hsinchu 30010 Taiwan;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
360 degrees video; Virtual Reality; HMD; prediction; machine learning; RNN; tiled streaming;

机译：360度视频;虚拟现实;HMD;预测;机器学习RNN;平铺流;

相似文献

外文文献
中文文献
专利

1. On the Optimal Encoding Ladder of Tiled 360° Videos for Head-Mounted Virtual Reality [J] . Fan Ching-Ling, Yen Shou-Cheng, Huang Chun-Ying, IEEE Transactions on Circuits and Systems for Video Technology . 2021,第4期

机译：关于铺砌虚拟现实瓷砖360°视频的最佳编码梯
2. REALIVE360: Multi-angle Virtual-reality Video-streaming Service that Gives the Viewer a Realistic Feeling of Being in a Theater [J] . Takahiko Sasahara, Masanori Emura, Takafumi Fukatani NTT Technical Review . 2021,第5期

机译：REAVIVE360：多角度虚拟现实视频流服务，使观众成为剧院的现实感觉
3. Tile-based 360-degree video streaming for mobile virtual reality in cyber physical system [J] . Son Jangwoo, Ryu Eun-Seok Computers and Electrical Engineering . 2018,第期

机译：基于Tile的360度视频流，用于网络物理系统中的移动虚拟现实
4. Optimizing 360° Video Streaming to Head-Mounted Virtual Reality [C] . Ching-Ling Fan, Cheng-Hsin Hsu IEEE International Conference on Pervasive Computing and Communications Workshops . 2018

机译：优化360°视频流以适应头戴式虚拟现实
5. Live video streaming for virtual reality through peer-to-peer network. [D] . Xu, Min. 2016

机译：通过点对点网络实现虚拟现实的实时视频流。
6. Can Simulated Nature Support Mental Health? Comparing Short Single-Doses of 360-Degree Nature Videos in Virtual Reality With the Outdoors [O] . Matthew H. E. M. Browning, Katherine J. Mimnaugh, Carena J. van Riper, 2019

机译：模拟自然可以支持心理健康吗？在户外使用虚拟现实中的短单剂量的360度自然视频
7. Evaluating the Factors Affecting QoE of 360-Degree Videos and Cybersickness Levels Predictions in Virtual Reality [O] . Muhammad Shahid Anwar, Jing Wang, Sadique Ahmad, 2020

机译：评估影响360度视频的QoE的因素和虚拟现实中的网络探测水平预测

Optimizing Fixation Prediction Using Recurrent Neural Networks for 360∘ Video Streaming in Head-Mounted Virtual Reality

摘要

著录项

相似文献

相关主题

期刊订阅