首页> 外文学位 >Multiview/stereoscopic video analysis, compression, and virtual viewpoint synthesis.
【24h】

Multiview/stereoscopic video analysis, compression, and virtual viewpoint synthesis.

机译:多视图/立体视频分析,压缩和虚拟视点合成。

获取原文
获取原文并翻译 | 示例

摘要

Stereoscopic or in general multiview video can provide more vivid and accurate information about the scene structure than from monoview video. However one major obstacle for using multiview video is the extremely large amount of data associated with it. This dissertation considers the problem of structure and motion estimation in multiview tele-conferencing type sequences and its application for video sequence compression and for intermediate view generation. First, we describe a novel image alignment approach, which can convert images captured using non-parallel cameras to coplanar like images. This approach greatly eases the computational burden incurred by the non-parallel camera geometry, where one must consider both horizontal and vertical disparities. Next, we introduce a new approach for structure estimation from a stereo pair acquired by two parallel cameras. It is based on a 3D-mesh representation of the imaged object and a parameterization of the structure information by the disparity between corresponding nodes in the image pair. Finally we present a coder for multiview sequences, which exploits the proposed alignment and structure estimation algorithm. By extracting the foreground objects and estimating the disparity field between a selected view and a reference view, the coder can compress the image pair very efficiently. In the mean time, by using the coded structure information, the decoder can generate virtual viewpoints between decoded views, which can be very helpful for tele-presence applications.
机译:立体或一般多视图视频可以提供比单视图视频更生动,准确的场景结构信息。然而,使用多视点视频的一个主要障碍是与之关联的大量数据。本文考虑了多视图电视会议类型序列的结构和运动估计问题及其在视频序列压缩和中间视图生成中的应用。首先,我们描述一种新颖的图像对齐方法,该方法可以将使用非平行相机捕获的图像转换为共面图像。这种方法极大地减轻了非平行相机几何结构带来的计算负担,在这种情况下,人们必须同时考虑水平和垂直视差。接下来,我们介绍一种从两个并行摄像机获取的立体声对进行结构估计的新方法。它基于成像对象的3D网格表示和基于图像对中相应节点之间视差的结构信息参数化。最后,我们提出了一种用于多视图序列的编码器,该编码器利用了提出的比对和结构估计算法。通过提取前景对象并估计所选视图和参考视图之间的视差字段,编码器可以非常有效地压缩图像对。同时,通过使用编码的结构信息,解码器可以在解码的视图之间生成虚拟视点,这对于远程呈现应用非常有用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号