运动背景建模：基于上下文编码器的新方法

92 浏览量更新于2024-08-26 1 收藏 750KB PDF 举报

"基于上下文编码器的运动背景建模是一种针对移动摄像机拍摄的视频的背景建模技术。此方法结合了运动背景建模和上下文编码器的思想，旨在在去除动态前景对象的同时，通过学习其上下文特征来重建视频的整体背景。尤其在相机快速移动的情况下，该方法能保持背景建模的稳定性与准确性。" 一、引言背景建模是计算机视觉系统中的关键部分，广泛应用于前景检测、目标分割、跟踪以及视频监控等领域。随着科技的发展，众多研究者已经提出了大量背景建模的方法。然而，处理由移动摄像机捕获的视频时，由于相机的运动和复杂的环境变化，背景建模的挑战性显著增加。二、上下文编码器与运动背景建模上下文编码器是一种深度学习模型，主要应用在图像修复和语义填充任务中。它通过学习图像的上下文信息，能够有效地恢复图像的缺失部分。在运动背景建模中，上下文编码器被用来捕捉视频帧间的时空关联，即使在前景物体移动或相机快速移动的情况下，也能精确地重建背景。三、方法描述 1. 前景检测：首先，通过对连续帧之间的差异进行分析，识别出动态的前景物体。 2. 上下文特征学习：使用卷积神经网络（CNN）结构的上下文编码器，提取并学习背景区域的上下文特征。 3. 背景恢复：去除前景物体后，利用学习到的上下文特征，填充并恢复背景区域，构建完整的背景模型。 4. 快速移动相机适应：由于上下文编码器对全局场景的理解，即使相机运动迅速，也能维持背景建模的质量。四、优势与应用基于上下文编码器的运动背景建模方法有以下优点： - 对相机运动的鲁棒性：相机快速移动时，仍能保持背景建模的准确度。 - 高质量的背景恢复：通过深度学习模型，能更精细地恢复背景细节。 - 实时性：适用于实时视频处理，满足实时监控等应用场景的需求。五、关键词运动背景建模、上下文编码器、卷积神经网络六、未来展望尽管这种方法已经在运动背景建模上展现出优势，但仍有提升空间。例如，可以探索更复杂的网络架构以增强模型的表达能力，或者引入更多的先验知识以提高背景恢复的准确性。此外，如何有效地处理光照变化、阴影以及多人移动等复杂情况也是未来的研究方向。总结，基于上下文编码器的运动背景建模方法为处理移动摄像机拍摄的视频提供了新的思路，通过深度学习技术提升了背景建模的性能，尤其是在动态环境下，具有较高的实用价值。

Motion Background Modeling based on Context-encoder

Zhenshen Qu, Shimeng Yu, Mengyu Fu

Dept. of Control Science and Engineering,

Harbin Institute of Technology,

Harbin, China

yushimeng@hotmail.com

Abstract—A background modeling method for motion-based

background of a video made by moving camera is proposed in

this paper. We utilize the recently proposed context-encoder to

model the motion-based background from a dynamic

foreground. This method aims to restore the overall scene of a

video by removing the moving foreground objects and learning

the feature of its context. An advantage of this method is that

the performance of background modeling will not be affected

when the camera is moving fast.

Keywords—motion-based background modeling; context-

encoder; convolutional neural networks

I. I

NTRODUCTION

Background modeling is an important component of

many computer vision systems and widely used before tasks

such as foreground detection [1], object segmentation [2],

tracking [3] and video surveillance [4].

Numerous research and studies have been done and a

huge amount of methods have been developed in this area

over recent years. These methods can be classified into

following categories [5]: Basic Background Modeling,

Statistical Background Modeling, Fuzzy Background

Modeling, Background Clustering, Neural Network

Background Modeling, Wavelet Background Modeling and

Background Estimation. More classifications can be found in

[6].

Conventional background modeling methods require a

fixed camera position to keep a stationary background, and a

great deal of work has been done with a stationary camera

about moving objects [7]. However, in some certain

conditions, the camera’s position changes and a modeling of

non-stationary background is needed. The Mixture of

Gaussians (MOG) background model shows its high

efficiency in multi-modal distribution background modeling

and has been widely used. The MOG can adjust to the

condition when some little changes happen to the

background (for example, the waving leaves and gradual

light change). But the MOG background modeling cannot

work well when the scene changes a lot. [8] presented an

approach of background modeling which is able to immune

to the variations of the background, but it does not work

when the movement of camera is fast and the background

changes a lot. [9] introduced a Spatial Distribution of

Gaussians (SDG) model which can detect foreground objects

with non-stationary background. Some similar and earlier

studies can be found in [10]. [11] proposed a real-time

optical flow algorithm to detect moving objects in a dynamic

scene. [12] is a further study of [11]. Another background

modeling method dealing with dynamic scenes, which

computes and utilizes optical flow in a higher dimensional

space towards the modeling of dynamic characteristics has

been proposed in [13]. The motion-based background

modeling method presented in [14] used optical flow to

detect moving objects. But motion field computation based

on optical flow can be time consuming.

In this paper, a new idea is proposed to estimate the

motion-based background while the camera is moving. We

use convolutional neural networks (CNNs) to achieve this

goal. We apply an unsupervised visual feature learning

algorithm presented in [15] to the process of our motion-

based background estimation. [15] introduced the context-

encoder which is used to predict the missing part of an image

according to the surroundings of the missing region in order

to make a prediction that approximate to the original scene as

much as possible. We utilize the context-encoder in the

process of restoring a complete background of a video made

by a moving camera which has dynamic obstacles. An

obvious advantage of this method on background extraction

is that the performance will not be affected even when the

camera is moving fast.

The rest of this paper is organized as follows: Some

related work is introduced in Section 2. Details of the

proposed motion-based background modeling method are

described in Section 3. We also discuss the problems we met

in the process of experiments and propose solutions in that

section. Then, in Section 4, the experimental results are

presented. Finally, the conclusion is given in the Section 5.

II. R

ELATED

ORK

CNNs have worked well in many semantic image

understanding tasks including unsupervised understanding

and natural images generating [15]. Autoencoders [16, 18]

which can learn features of an image are typical deep

unsupervised learning method in this field. Denoising

autoencoders [17] can “make the learned representations

robust to partial corruption of the input pattern”. The

context-encoder [15] could be thought of as a variant of

*Research supported by Chinese National Natural Science

Foundation(61375046, Scene flow computation based on dynamic

rimitive feature)

下载后可阅读完整内容，剩余4页未读，立即下载

weixin_38663151

粉丝: 3

运动背景建模：基于上下文编码器的新方法

上下文编码器：利用GAN技术实现无监督图像修复与特征学习

Python实现的CE-Net：2D医学图像分割的上下文编码器

CE-Net：深度学习医疗图像分割的创新上下文编码器

基于视觉推理的视频理解技术.pptx

具有颜色提示的基于主动消光的对象跟踪

基于深度学习的视觉跟踪算法研究综述.pdf

H．264／AVC中的CABAC编码技术.pdf

libx264视频编码器中的运动估计算法解析

AV1视频编码器的工作流程及其性能评估

HEVC_H.265编码器中的主要优化策略

最新资源