高斯混合模型在前景检测中的背景建模综述

背景建模

需积分: 9 14 浏览量更新于2024-07-25 收藏 408KB PDF 举报

身份认证购VIP最低享 7 折!

领优惠券(最高得80元）

"背景建模是计算机视觉领域中的一个重要概念，主要应用于从静态摄像头视频中检测移动物体。背景建模通过创建一个表示静态环境的模型，然后对比这个模型与实际帧来识别出移动的前景物体。混合高斯模型（Mixture of Gaussians, MOG）是一种广泛应用的背景建模方法，由Stauffer和Grimson最早提出。在MOG模型中，背景被视为不同高斯分布的混合，每个像素值被看作是这些分布的概率密度函数的线性组合。这种方法的优点在于它能够处理光照变化、阴影和小范围的背景运动等复杂情况。然而，原始的MOG模型也存在一些问题，如对快速运动、新物体引入、背景更新不及时等问题的处理不够理想。论文《Background Modeling using Mixture of Gaussians for Foreground Detection - A Survey》对近年来针对MOG模型的改进进行了全面的调查和分类。作者Bouwmans、El Baf和Vachon分析了原始MOG的工作原理，并讨论了在处理视频序列时遇到的挑战。他们将改进方法分为几个类别，根据策略的不同来改善原始MOG模型，同时也针对它们各自声称能处理的困难场景进行了讨论。这些改进策略包括但不限于： 1. 更高效的背景初始化和更新机制，以适应环境变化。 2. 考虑时间上下文，用更复杂的统计模型来捕捉长时间的背景变化。 3. 使用更精细的高斯混合模型，例如增加高斯分量的数量或动态调整分量权重。 4. 引入自适应阈值，以适应光照变化和背景噪声。 5. 应对阴影和反射，避免将它们误判为前景。通过对这些策略的分析，论文指出了每种方法的局限性，并提出了未来研究的一些可能方向，包括但不限于更加高效的计算方法、更好的背景建模适应性以及对动态背景的处理。关键词：背景建模、前景检测、混合高斯模型" 这篇摘要详细介绍了背景建模中的混合高斯方法及其改进，对于初学者来说是一个很好的入门资料，有助于理解如何通过这种统计模型来区分视频中的前景和背景。

资源详情

资源推荐

Table 1. Challenges and MOG Versions

Critical Situations References

Noise Image (NI) [61 -64]

Camera jitter (CJ) [58, 62, 65, 66]

Camera Adjustements (CA)

Auto Gain Control [67]

Auto White Balance [68]

Automatic Exposure Correction [69]

Gradual Illumination Changes (TD) [1, 24, 38, 63, 70-73, 74, 75]

Sudden Illumination Changes (LS) [24, 61, 63, 65, 67, 68, 70, 71,

74-81]

Bootstrapping during initialization

(B)

[59, 82, 83]

Bootstrapping during running (B) [84 -88]

Camouflage (C) [42, 72, 73, 88-92]

Foreground Aperture (FA) [93]

Moved background objects (MBO) [60, 63, 70, 74, 75, 80 ,85, 87,

88]

Inserted background objects (IBO) [60, 63, 70, 74, 75, 85, 87, 88]

Multimodal background (MB) [1, 61, 64, 84, 86, 90, 94-107]

Waking foreground object (WFO) [74 -75, 80, 85, 87, 88]

Sleeping foreground objects (SFO) [1, 42, 60,74,75,78,79, 85, 87,

88, 108-114]

Shadows and highlights (S) [61, 62, 68-70, 81, 101, 109,

115-123]

Table 2. Real Time Constraints and MOG Versions

Real-Time Constraints References

Computation Time (CT) [24, 43, 92,124-131]

Memory Requirement (MR) [127, 128]

time. To solve this problem, Zivkovic [94] proposes an

online algorithm that estimates the parameters of the MOG

and simultaneously selects the number of Gaussians using

the Dirichlet prior. The consequence is that K is dynamically

adapted to the multimodality of each pixel. In the same idea,

Cheng et al. [95] propose a stochastic approximation

procedure which is used to recursively estimate the

parameters of MOG and obtains the asymptotically optimal

number of Gaussians. Another approach proposed by

Shimada et al. [96] consists in a dynamic control of the

number of Gaussians. This approach automatically changes

the number of Gaussians in each pixel. The number of

Gaussians increases when pixel values often change. On the

other hand, when pixel values are constant in a while, some

Gaussians are eliminated or integrated. Another idea

proposed by Tan et al. [97] consists in a modified online EM

procedure to construct an adaptive MOG in which the

number K can adaptively reflect the complexity of pattern at

the pixel. Carminati et al. [98] estimate the optimal number

of K Gaussians for each pixel in a training set using an

ISODATA algorithm. This method is less adaptive than the

others because K isn’t updated after the training period.

3.2. Initialization of the Weight, the Mean and the

Variance

Stauffer and Grimson [1] initialized the weight, the mean

and the variance of each Gaussian using a K-means

algorithm. A training sequence without foreground is

needed. This initialization scheme is improved as follows:

By using another algorithm for the initialization:

Pavlidis et al. [99] show that an EM algorithm [51] is a

superior initialization method that provides fast learning and

exceptional stability to the foreground detection. This is

especially true when initialization happens during challen-

ging weather conditions like fast moving clouds or other

cases of multimodal background (MB). The disadvantage is

that the EM algorithm is computationally intensive. In the

continuity, Lee [84] proposes an approximation of the EM

algorithm to avoid unnecessary computation or storage. His

results on both synthetic data and surveillance videos show

better learning efficiency and robustness in case of (B) and

(MB) than the algorithm used by Friedman and Russel [50],

Stauffer and Grimson [1], and Bowden et al. [132].

By allowing presence of foreground objects in the

training sequence: Following the assumption that the

background’s pixels appear in the image sequence with the

maximum frequency, Zhang et al. [60] propose a

background reconstruction algorithm to initialize the MOG

even in presence of foreground in the scene. Another

approach proposed by Amintoosi et al. [82] consists in a QR-

decomposition based algorithm. To be more robust when

large parts of the background are occluded by moving

objects and parts of the background are never seen, Lepisk

[83] proposes to use the optic flow to reason about if the

background has been seen or not. This method is more robust

in the case of bootstrapping (B).

3.3. Maintenance of the Weight, the Mean and the

Variance

Stauffer and Grimson [1] updated the weight, the mean

and the variance of each Gaussians with an IIR filter using a

constant learning rate

for the weight update and a

learning rate

for the mean and variance update. This

maintenance scheme is optimized in the literature through

three different ways:

(1) Maintenance Rules: The update of the parameters in

Stauffer and Grimson [1] is made using an IIR filter like

shown in the Equation (6). The disadvantage is that it is

necessary to choose using a training sequence the learning

rate

which is then fixed for all the sequence. To improve

the robustness and sensitively to gradual illumination chan-

ges (TD), Han and Lin [38] update the MOG via adaptive

Kalman filtering. The main interest is that the Kalman filter

proposed adjusts its gain depending on the normalized

hal-00338206, version 1 - 12 Nov 2008

剩余18页未读，继续阅读

u010701520

粉丝: 0
资源: 1

高斯混合模型在前景检测中的背景建模综述

从局部信息推测基恩士的Removing BackGround Information算法的实现。.doc

Combining background information and a top-down model for computing salient objects

Describe the background information of GMAW process and metal transfer image in detail

Describe the background information of Significance of analyzing metal-transfer images for quality control and process optimization in detail

how to wite an essay to report bar charts

current license file does not support the EPCTC device

out0['厂家交流'].loc[out1.index[i]]=1

submitHandler()

u8 RFRead[16]

got an unexpected keyword argument 'autopact'

matlab get_background

QMessageBox::information 设置样式

the research background of brainwaves

设置QMessageBox的文字颜色和表头上的图标，通过样式表

class="screen__background__shape screen__background__shape4

QMessageBox::information设置背景

最新资源

class="screenbackgroundshape screenbackgroundshape4