深度图驱动的Kong填充算法：利用时间相关性修复3D渲染洞

研究论文

156 浏览量更新于2024-08-26 收藏 664KB PDF 举报

身份认证购VIP最低享 7 折!

30元优惠券

"这篇研究论文探讨了一种利用时间相关性信息的深度图驱动 Kong 填充算法，旨在解决自由视角电视系统中的视差填充问题。该算法通过挖掘纹理和深度信息的时间相关性，生成背景参考图像，用于填补动态场景部分的空洞，而静态部分则采用传统的图像修复方法。在生成背景参考图像时，论文采用了高斯混合模型处理纹理信息，同时利用深度图信息来检测运动物体，以增强背景参考图像的质量。此算法特别适用于单视图加深度格式的场景，与多视图加深度格式相比，仅基于单个视点的信息进行处理。" 本文是关于深度图像渲染技术在实现自由视角电视中的应用。自由视角电视允许观众从不同角度观看内容，但这个过程中的一个关键挑战是处理3D变形导致的遮挡问题，即“空洞”或“disocclusion”。为了解决这个问题，作者提出了一种创新的填充算法。首先，该算法利用时间序列中的纹理和深度信息的关联性。时间相关性是指连续帧之间物体和背景的相对位置和运动模式，这种关系可以被用来预测和填补因物体移动造成的图像空洞。通过对连续帧的分析，可以追踪到静态和动态物体的变化。具体实现中，高斯混合模型被应用于纹理信息，用于对图像进行建模和恢复。这一方法能够有效地捕捉图像中的复杂纹理模式，提高填充效果的逼真度。同时，深度图信息被用来检测和区分场景中的运动物体，以便于在背景和前景之间进行精确的空洞填补。对于静止的物体部分，传统的图像修复方法仍然适用，而对于动态物体，背景参考图像则起到了关键作用。在单视图加深度格式的场景下，由于只有单个视点的信息可用，这种算法显得尤为重要。在这种情况下，算法需要更有效地利用有限的信息来生成高质量的填补结果，从而提供更连贯、更真实的视觉体验。这篇论文提出的方法为自由视角电视技术提供了一个有效的解决方案，通过深度图驱动的Kong填充算法，增强了动态场景的处理能力，提高了图像的完整性，有助于提升观众的沉浸式观看体验。这种方法不仅理论上有价值，也有望在未来实际的自由视角电视系统中得到应用。

资源详情

资源推荐

396 IEEE TRANSACTIONS ON BROADCASTING, VOL. 60, NO. 2, JUNE 2014

Fig. 3. Example of BG reference frame by using GMM. (a) Original texture

frame from Outdoor sequence. (b) BG reference frame obtained by using

GMM.

of the proposed approach is outlined in Fig. 2. This approach

is based on the observation that most of occluded regions

belong to the background which are covered by the foreground

objects, and these occluded regions might become visible in

other frames, due to the foreground movement. Thus, in the

proposed approach, we generate a temporal stable background

image in ofﬂine mode, then this image is used to ﬁll the dis-

occlusion regions in the DIBR system. As shown in Fig. 2

(a), in the proposed approach, an ofﬂine preprocessing step is

used to generate a background image, by using several consec-

utive texture frames. In this stage a stable background image

can be generated with the Gaussian Mixture Model (GMM),

where the regions covered by the moving foreground objects

are replaced by the temporal “stable pixels”. In most cases,

the temporal stable pixels belong to the background, espe-

cially for the covered regions by the foreground objects with

translational motion. An example of that is shown in Fig. 3,

where the background information covered by the moving car,

could be recovered by using GMM method. However, in some

cases, this process may blur the moving regions, especially

for foreground objects with reciprocal motion, such as the

one seen in Fig. 4 (a). In this scene, the dancer is rotating,

and consequently most of foreground information are mis-

takenly modeled as background by using GMM. Therefore,

the movement of foreground objects will be detected in the

Foreground Depth Correlation (FDC) stage to help recover

the background information. With the combination of GMM

and FDC, a background image can be obtained.

This background information can be used during disocclu-

sion ﬁlling in DIBR system. Obviously, using GMM and FDC

can only help recovering the background regions occluded by

the moving objects. Therefore, in the proposed disocclusion

ﬁlling approach, the disocclusion along the static foreground

objects, will not be updated using the background infromation,

but using the conventional inpainting method [22]. Moreover,

for some small disocclusion or holes caused by discontinuity

of depth value, the inpainting method will be used for these

regions ﬁlling. The details of each step will be described in

the following sections.

III. B

ACKGROUND GENERATION

A. Background Generation With GMM

The Gaussian Mixture Model is a commonly used method

to detect the moving objects [23], and in the computer vision

ﬁeld it has been widely applied to model the stable back-

ground. Different from the methods based on block matching,

the GMM is performed at pixel level, where each pixel is mod-

eled independently by a mixture of K Gaussian distributions

(a common setting is K = 3) [24], [25]. The Gaussian mixture

distribution with K components can be written as:

p(x

) =



i=1

i,t

· η



,μ

i,t

,σ

i,t



(1)

where p(x

) indicates the probability density of pixel x

, η is

the Gaussian function with x

representing the pixel value at

time t, μ

i,t

and σ

i,t

denote the mean and variance of pixel x

respectively, and ω

i,t

is the ith Gaussian distribution’s weight,

with



i,t

= 1.

The detailed process of GMM that generates the stable

reference background is described as follows [26]:

1) Firstly, an empty set of models is initialized at the time

instant t

• The mean value μ

i,t

of the ﬁrst Gaussian model is

set equal to the pixel value of the current frame, and

that of the other models is set to 0.

• The variance value σ

i,t

of all the K Gaussian mod-

els are set to a pre-deﬁned large value, e.g., 30 in

this paper.

• The weight value of the ﬁrst Gaussian model ω

1,t

is set to 1, and that of other models is set to 0.

2) For the next frame at the time instant t

, the current pixel

is used to match with the K Gaussian models. Then, for

each model i, the condition |x

− μ

i,t−1

|≤2.5σ

i,t−1

will

be examined.

• If the condition is satisﬁed, the matching pro-

cess will be stopped and all the parameters of

the Gaussian models will be updated using the

following role:

– The mean value of the matched Gaussian model,

i.e., the ith model becomes, μ

i,t

= (1−ρ)μ

i,t−1

ρx

, where ρ = α · η(x

,μ

i,t

,σ

i,t

),theα is the

learning rate, which is set to 0.005 [26].

– The variance value of the matched Gaussian

model, σ

i,t

= (1 − ρ)σ

i,t−1

+ ρ(x

− μ

i,t

)

– The weight value of the matched Gaussian model,

i,t

= (1 − α)ω

i,t−1

+ α.

– The mean and the variance of the other Gaussian

models remain unchanged, while the correspond-

ing weight value is updated, ω

i,t

= (1−α)ω

i,t−1

• Whereas, if all of the Gaussian models fail to match

the current pixel, then a new Gaussian model is

introduced with μ = x

,highσ

(e.g., σ = 30)

and a low weight value ω = 0.001 by evicting the

Gaussian model which has the smallest ω/σ value.

– the mean and variance value of the other

Gaussian models remain unchanged;

– the weight value of K Gaussian models are

normalized to



i,t

= 1.

3) The remaining frames are processed by repeating the

previous step (2).

剩余10页未读，继续阅读

weixin_38655682

粉丝: 3
资源: 886

深度图驱动的Kong填充算法：利用时间相关性修复3D渲染洞

中值滤波代码matlab-DIBR-Algorithm:基于深度图像的渲染算法在MATLAB中的实现。对于Kong填充，已使用具有快速进阶方法

Kinect深度Kong填充的三边约束稀疏表示

使用docker安装kong

kong java_KONG网关 — 介绍安装

centos7 安装kong

docker kong 设置https

kong节点的创建与使用

docker kong 搭建

linux安装kong网关

kong 设置http跳转https

docker kong 设置http跳转https

kong gateway

kong添加nignx配置文件

kong的配置文件的编写规则

yolov5 实例分割 tensorrt fish-kong 机器鱼

kong 网关升级失败

kong网关安装数字证书

kong网关服务有调整增加代理服务器的超时时间，以便更长时间等待上游服务器响应的配置文件么

kong debian

openresty安装kong

最新资源