特征保留深度图重采样视图合成技术

99 浏览量更新于2024-08-26 收藏 621KB PDF 举报

"使用保留特征深度图重采样的新型视图合成" 本文介绍了一种创新的3D场景图像合成技术，特别是在新颖视点上的应用。这项技术主要针对一组随意拍摄的参考图像，旨在解决如何从不同角度合成清晰、高质量的3D场景图像的问题。通过输入这些参考图像，该方法首先重建场景的稀疏3D点云，这是通过分析各图像中的对应关系和结构信息实现的。3D点云是3D建模的基础，它提供了场景空间位置的关键信息。接着，为了进一步处理这些数据，文章提出了一个改进的误差扩散采样方法。这种方法能够在保留图像特征的同时，从3D点云中提取出关键的深度点。在每个参考图像中生成的采样点集不仅包含深度信息，而且有效地保持了图像的纹理和形状特征，这对于后续的图像处理至关重要。图像特征的保留有助于在合成新视图时保持视觉一致性。然后，作者们引入了一个基于欧氏距离、颜色相似性和边界分布的综合距离度量。这个度量用于将深度信息从深度点有效地传播到采样点集中的其他点。通过这种方式，可以生成一个密集的深度图，深度图是合成新视图时的重要中间步骤，因为它提供了场景深度的连续表示。当需要合成新视点的图像时，算法会选择几个与目标视点最接近的参考视点。这些参考视点的彩色深度图被投影到目标视点，从而创建出与新视点相对应的视图。然而，由于视角变换可能导致部分区域重叠或缺失（即所谓的Kong洞），因此需要对多个投影图像进行融合以填充这些空洞，确保最终合成图像的完整性和连续性。实验结果显示，即使在包含复杂物体和室外场景的情况下，该方法也能产生令人满意的结果。这表明其在处理多视点图像合成时具有较高的精度和效率。关键词包括：新型视图合成、深度图、重要性采样和图像投影，这些都是本文的核心技术概念。这项工作为3D场景的视图合成提供了一个特征保留和深度信息有效传播的解决方案，对于虚拟现实、增强现实以及计算机图形学等领域有着重要的应用价值。通过优化的采样策略和深度信息传播机制，该方法能处理复杂的视觉场景，生成高质量的新视图，从而为用户带来更为真实的视觉体验。

Novel View Synthesis using Feature-preserving Depth Map Resampling

Duo Chen, Jie Feng and Bingfeng Zhou

Institute of Computer Science and Technology, Peking University, Beijing, China

{chenduo, feng jie, cczbf}@pku.edu.cn

Keywords:

Novel View Synthesis, Depth Map, Importance Sampling, Image Projection.

Abstract:

In this paper, we present a new method for synthesizing images of a 3D scene at novel viewpoints, based on a

set of reference images taken in a casual manner. With such an image set as input, our method ﬁrst reconstruct

a sparse 3D point cloud of the scene, and then it is projected to each reference image to get a set of depth

points. Afterwards, an improved error-diffusion sampling method is utilized to generate a sampling point set

in each reference image, which includes the depth points and preserves the image features well. Therefore the

image can be triangulated on the basis of the sampling point set. Then, we propose a distance metric based on

Euclidean distance, color similarity and boundary distribution to propagate depth information from the depth

points to the rest of sampling points, and hence a dense depth map can be generated by interpolation in the

triangle mesh. Given a desired viewpoint, several closest reference viewpoints are selected, and their colored

depth maps are projected to the novel view. Finally, multiple projected images are merged to ﬁll the holes

caused by occusion, and result in a complete novel view. Experimental results demonstrate that our method

can achieve high quality results for outdoor scenes that contain challenging objects.

1 INTRODUCTION

Given a set of reference images of a scene, novel view

synthesis (NVS) methods aim to render the scene at

novel viewpoints. NVS is an important task in com-

puter vision and graphics, and is useful in areas such

as stereo display and virtual reality. Its applications

include 3DTV, Google Street View (Anguelov et al.,

2010), scene roaming and teleconferencing.

NVS methods can be divided into two categories:

small-baseline methods and large-baseline methods,

where “baseline” refers to the translation and rotation

between adjacent viewpoints.

In the case of small-baseline problems, some

methods focus on parameterizing the plenoptic func-

tion with high sampling density. They arrange the

camera positions in well-designed manners and sam-

ple the scene uniformly with reference images. Typ-

ical examples include light ﬁeld (Levoy et al., 1996)

and unstructured lumigraphs (Buehler et al., 2001).

Some other methods (Mahajan et al., 2009; Evers-

Senne and Koch, 2003) were proposed to produce

novel views by interpolating video frames, where ad-

jacent video frames have close viewpoints. Some

methods based on optical ﬂow also belong to the

small-baseline category.

On the other hand, large-baseline NVS is a chal-

lenging, under constrained problem due to the lack of

full 3D knowledge, scale changes and complex oc-

clusions. It is thus necessary to seek additional depth

and geometry information or constraints like photo-

consistency and color-consistency.

For example, Google Street View (Anguelov et al.,

2010) directly acquire depth information with laser

scanners to interpolate large-baseline images. Some

other methods utilize structure-from-motion (SFM)

and multi-view stereo (MVS) to recover sparse 3D

point cloud of the scene and synthesis novel views

based on them. For instance, the rendering algo-

rithm of Chaurasia et al. (Chaurasia et al., 2013) syn-

thesized depth for the poorly constructed regions of

MVS and provides a plausible image-based naviga-

tion. However, their approach is limited by the ca-

pabilities of the oversegmentation, and the very thin

structures in the novel view may be missing.

Recent works also address the problem of large-

baseline NVS by training neural networks in an end-

to-end manner (Flynn et al., 2016). These methods

only require sets of posed images as training dataset,

and are general since they can give good results on

test sets that are considerably different from the train-

ing set. These methods are usually slower than MVS

based methods, and detailed textures in the images are

usually blurred. Moreover, the relationship between

3D objects and their 2D projections has a clear for-

mulation, and requiring neural networks to learn this

Chen, D., Feng, J. and Zhou, B.

Novel View Synthesis using Feature-preserving Depth Map Resampling.

In Proceedings of the 14th Inter national Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2019) - Volume 1: GRAPP, pages

193-200

ISBN: 978-989-758-354-4

193

下载后可阅读完整内容，剩余7页未读，立即下载

weixin_38691055

粉丝: 10
资源: 930

特征保留深度图重采样视图合成技术

New-View-Synthesis:收集有关新视图综合的论文

使用联合边引导卷积神经网络的深度图上采样进行虚拟视图合成

python图像重采样

cloudcompare重采样

QGIS将遥感图像重采样

在灰度图像中利用插值方式将图像重采样为128*128的标准化图表示（imresize）IM；

在灰度图像中利用插值方式将图像重采样为128*128的标准化图表示是什么意思

python重采样图片

如何对光谱图进行数据重采样？用到什么软件？

c语言 重采样函数 下载

最新资源

c语言重采样函数下载