背景字典学习的运动物体检测算法

PDF格式 | 553KB | 更新于2024-08-27 | 48 浏览量 | 举报

"基于背景字典的运动物体检测" 在计算机视觉领域，运动物体检测是关键任务之一，它广泛应用于视频监控、自动驾驶、无人机导航等多个场景。传统的运动物体检测方法，如高斯混合模型（GMM），虽然在某些情况下表现良好，但其处理方式存在局限性。GMM通过对每个像素进行处理，容易受到图像噪声的影响，同时计算复杂度较高，这在处理实时视频流时可能会成为瓶颈。针对这些问题，本文提出了一种创新的算法——基于背景字典的运动物体检测。该算法的核心思想是将每一帧图像均匀划分为多个图像片段（或称为图像块），每个片段可能包含背景或者运动物体。这种分割方法有助于减小噪声对检测结果的影响，因为较大的图像块能更好地平均噪声。接下来，对于每个图像片段，构建一个背景字典。这个背景字典是根据历史帧中的信息学习得到的，它包含了该区域在正常情况下的典型特征。通过比较当前图像片段与背景字典中的特征，可以判断该片段是属于背景还是运动物体。相似度计算通常采用距离度量，如欧氏距离或余弦相似度，以确定片段与背景的匹配程度。文章进一步提出了背景字典的动态更新策略。在视频序列中，环境可能会发生变化，因此背景字典需要能够适应这些变化。更新规则确保了背景字典的实时性和准确性，即使在光照变化、阴影移动等复杂情况下也能有效地检测运动物体。实验结果验证了该算法的有效性和鲁棒性。与传统的GMM方法相比，基于背景字典的算法在减少噪声干扰、降低计算复杂度以及提高检测精度方面表现出显著优势。关键词包括：运动物体检测、高斯混合模型、背景字典、相似度测量和动态更新。这项研究为运动物体检测提供了一个新的视角，利用背景字典学习和更新策略，提高了在复杂环境下的检测性能。这种方法对于实际应用，尤其是需要高效、准确运动物体检测的系统，具有重要的理论和实践价值。

MOVING OBJECT DETECTION BASED ON BACKGROUND DICTIONARY

Hua-sheng Zhu, Jun Wang, Chen-guang Xu, and Jun Ye

School of Information Engineering, Nanchang Institute of Technology, Nanchang 330099, China

ABSTRACT

Gaussian Mixture Model (GMM) and its variations

process images by per pixel, so they may be corrupted by

noises and the computational cost is high. In this paper,

we propose a robust moving object detection algorithm

with a background dictionary learning. To do this, we first

divide an image into multiple image patches that have the

same sizes. Each patch is the object or background. Then,

A background dictionary is learnt for each patch. The

similarity between a patch and the background dictionary

is measured, upon which a patch is distinguished between

the object and the background. Additionally, in order to

adapt the dynamic contexts across in a video sequence, a

robust background dictionary updating scheme is

proposed. Experimental results demonstrate the

effectiveness and robustness of the proposed detection

algorithm.

Key Words — Moving object detection; Gaussian

mixture model; background dictionary; similarity

1. INTRODUCTION

Moving object detection is a hot research topic in

computer vision. Generally speaking, moving object

detection can be broadly categorized into three groups,

namely optical flow method [1-3], frame subtraction

method [4-5], and background subtraction method [6-8].

The optical flow is the pattern of apparent motion of

objects, surfaces, and edges in a visual scene caused by

the relative motion between an observer and the scene.

The optical flow method is susceptible to be interfered by

noises and the computational cost is high. The frame

subtraction method captures a moving object by com-

puting the differences between two adjacent frames. This

method has a low computational cost, and it is robust to

illumination variations. However, the frame subtraction

method is not stable due to the moving speed variations,

and it is not able to capture the whole outline. The

background subtraction method compares the intensity

between the current image and the corresponding

backgrounds. Because of the robustness for the moving

object detection, the background subtraction method is

widely applied. Modeling a background model is critical

before segmenting a moving object. Stauffer et al. [9]

proposes a background modeling based on the Gaussian

Mixture Model (GMM). GMM is widely applied in

moving object detection for videos. By updating really

the background model, GMM can efficiently overcome

the small perturbations caused by the dynamic

background and the noises caused by camera shaking.

However, GMM is not robust to the influence caused by

severe illumination variations. The improving variations

[10-12] of GMM are proposed, however, these methods

process an image by per pixel, upon which the probability

density is computed. So the computational cost for GMM

is high, and it is not robust to noises. Recently, the DPM

based object detection algorithm[13] and the deformation

dictionaries based object detection algorithm[14] are

proposed.

In this paper, we propose a robust moving object

detection method based on a learnt background dictionary.

For an image, it is divided into multiple image patches

and the similarity between each patch and the

corresponding dictionary is computed. Then the patch is

distinguished as an object or background based on the

similarity. The proposed method is robust to illumination

variation, and the computational cost is low.

2. BACKGROUND DICTIONARY BASED MOVING

OBJECT DETECTION

In this section, we describe the details of the proposed

method including: the framework of the proposed method,

the dictionary initializing and dictionary updating.

2.1. The Framework Of The Proposed Method

The framework of the proposed moving object detection

is illustrated in Fig. 1. It consists of four major

components: the video input, the background dictionary

initializing, background dictionary updating, moving

object detection.

A video are composed of many frames:

{, ,...,},

Vff f= (1)

where V is a video, f

is the i

frame.

Each frame consists of the background and moving

object:

bg=+ (2)

where b is the background of the current frame image, g is

the moving object.

Each frame can be divided into several image patches

that have a same size as:

{

}

,,, ,

fpp p R

=∈

(3)

下载后可阅读完整内容，剩余4页未读，立即下载

weixin_38735899

粉丝: 2

背景字典学习的运动物体检测算法

基于字典学习的机织物瑕疵自动检测研究

论文研究-基于在线字典学习的管道微弱泄漏检测方法.pdf

基于稀疏表达残差的自然场景运动目标检测

电信设备-基于运动信息及稀疏投影的视频目标自动检测跟踪方法.zip

基于稀疏表示与字典学习的背景差分算法优化

利用OpenCV和背景Codebook模型进行高效前景检测

视觉运动目标检测：码本建模算法研究与进展

分层匹配五元组Codebook算法在运动目标检测中的应用

自适应背景建模方法：实现高效物体识别与跟踪

SIFT流与稀疏表示：应对突然运动跟踪的自适应样本选择

最新资源