基于MSRM的图像序列目标提取方法

需积分: 8 177 浏览量更新于2024-09-09 收藏 429KB PDF 举报

本文档《MSRM基于对象提取方法的图像序列》主要探讨了在计算机视觉应用中至关重要的对象提取技术，特别是针对图像序列。作者温婷、王晶玲和叶龙来自中国传媒大学媒体音频与视频关键实验室，他们提出了一个基于最大相似性区域合并（Maximal Similarity Region Merging, MSRM）的交互式对象提取方法。这种方法的独特之处在于，用户仅需在图像序列中的任一张图片上标记一次目标对象和背景，就能得到整个序列的对象提取结果。相较于当前广泛应用的基于图割算法的方法，每个图像都需要逐一标记，这种方法显著提高了效率，同时提取结果的精度并不逊色于其他高级方法。 MSRM的核心思想是利用区域之间的最大相似性来合并或分割像素，这有助于减少手动标注的工作量，并且在处理连续的图像帧时能够保持一致性。通过这种方法，研究人员能够在用户较少干预的情况下，实现对复杂场景中目标对象的高效分离，这对于视频监控、视频分析、物体跟踪以及自动化内容理解等领域具有重要意义。在介绍部分，文章强调了准确分离前景对象和背景对于计算机视觉任务的重要性，尤其是在需要处理大量数据和实时性的场景中。相比于传统的逐帧标注，该方法的优势在于它简化了用户的操作流程，减少了重复劳动，同时利用了序列信息来提升整体的提取效果。论文可能还会深入讨论MSRM算法的具体实现步骤，包括如何构建相似性度量，如何进行区域合并和分裂决策，以及如何优化算法以适应不同类型的图像和对象。此外，可能还会有实验部分，展示该方法在各种测试场景下的性能比较，证明其在效率和精度方面的优越性。这篇论文为图像序列中对象提取提供了一种创新且实用的解决方案，它通过MSRM算法实现了自动化和高效的目标识别，为计算机视觉领域带来了一种新的工作流程。对于从事图像处理、机器学习或计算机视觉研究的读者来说，这篇论文提供了有价值的技术参考和实践经验。

MSRM Based Object Extraction Method for Image Sequences

Wenting Yu

, Jingling Wang

, Long Ye

1, 2

Key Laboratory of Media Audio & Video, Ministry of Education, Communication University of

China, Chaoyang District, 100024, Beijing, P.R. China

email:ywt_happy@sina.com

Keywords: Object extraction, SLIC-super pixel, MSRM

Abstract. Object extraction, which aims to accurately separate a foreground object from its

background in still images, plays an important role in many computer vision applications. An

interactive object extraction method based on MSRM (maximal similarity based region merging) is

presented in this paper. We can manually mark the target and background only one time in any one

image of the image sequence to obtain the object extraction result of the image sequence. Compared

to currently used method based on graph cut algorithm that manually marks the target and

background on all the images one by one to get the object extraction result, our method is more

efficient and the result is as precious as with other methods.

Introduction

Accurately separate a foreground object from its background plays an important role in many

computer vision applications. Image segmentation is to separate the desired objects from the

background, recently various methods have been proposed, For example, in [1, 2], Lietal combined

graph-cut with watershed pre-segmentation for better segmentation outputs, where the segmented

regions by watershed, instead of the pixels in the original image, are regarded as the nodes of graph

cut. Relies on the work of Graph Cut, a lot of methods are proposed, such as Grabcut and Lazy

Snapping. Jifeng Ning and Lei Zhang proposed a novel interactive region merging method based on

the initial segmentation of mean shift which is called MSRM[3].

Grabcut[4] extends graph cut to color image and incomplete trimaps, and consists of two

portions: automatic hard segmentation and border matting portion. The idea of the automatic

segmentation is to build a graph where each node corresponds to a pixel such that a

max-flow/min-cut algorithm solves the segmentation iteratively. The inclusion of color information

by using Gaussian Mixture Models in the Graph Cut algorithm increases its robustness.

Lazy Snapping[1] also separates the object extraction into two tasks: object context specification

and boundary refinement. Graph cut[5] is used in both tasks. To specify an object in a given image,

the user marks a few lines on the image by dragging the mouse cursor while holding a button (left

button indicating the foreground, and right button for the background). As the author admitted, the

object marking step and boundary editing step have not been combined in a unified way.

In the MSRM [3] method, the interactive information is introduced as markers, which are input

by the users to roughly indicate the position and main features of the object and background. Then

the proposed method will calculate the similarity of different regions and merge them based on the

proposed maximal similarity rule with the help of these markers. The object will then be extracted

from the background when the merging process ends.

Based on the above considerations, we proposed a fast computation object extraction method for

image sequences based on the MSRM [3]. In this method, the users manually mark the target and

background only one time and can extract the object from the image sequence successfully.

Compared to currently used method based on graph cut algorithm that manually marks the target

and background on all the images one by one to get the object extraction result, our method is more

efficient and the result is as accurate as with other methods. Experimental results on multiple kinds

of color image sequences show the effectiveness and convenience of the approach.

Advanced Materials Research Vols. 1049-1050 (2014) pp 1675-1680 Submitted: 2014-08-25

doi:10.4028/www.scientific.net/AMR.1049-1050.1675 Online: 2014-10-10

Tech Publications, www.ttp.net. (ID: 130.237.29.138, Kungliga Tekniska Hogskolan, Stockholm, Sweden-11/07/15,13:46:35)

下载后可阅读完整内容，剩余6页未读，立即下载

qq_38180209

粉丝: 0
资源: 1

基于MSRM的图像序列目标提取方法

MSRM3主程序v3.21.324.0.zip

MSRM3 多交换机路由器监控3 Ver:3.20.1024.0

MSRM3主程序v3.21.629.0.zip

基于matlab的数字图像处理.pdf

基于matlab的数字图像处理 (2).pdf

MSRM_1.rar_MSRM _MSRM_1_交互式分割_合并_相似区域合并

基于区域合并的纹理图像分割—MSRM算法的MATLAB实现.doc

MSRM3 多交换机路由器监控3 Ver:3.20.609.2

MSRM3 多交换机路由器监控3 Ver:3.20.1014.1

MSRM3 多交换机路由器监控3 Ver:3.20.801.0

最新资源