交互式图像分割：GrabCut 迭代图割算法

需积分: 10 181 浏览量更新于2024-09-22 1 收藏 6.17MB PDF 举报

"GrabCut是一种交互式的图像前景提取算法，它结合了纹理（颜色）信息和边缘（对比度）信息，通过图割优化方法实现高效、准确的图像分割。该算法在用户交互方面进行了改进，降低了对用户输入的要求，并且引入了边框贴图技术来估算对象周围的Alpha遮罩，从而实现更精确的前景提取。" GrabCut算法是计算机视觉领域中的一个关键技术，主要用于图像编辑中的前景与背景分割。传统图像分割工具如Magic Wand依赖于纹理信息，而Intelligent Scissors则侧重于边缘信息。GrabCut算法则将两者结合起来，提供了一种更为综合的方法。在GrabCut算法的核心，是基于图割（Graph Cut）的优化过程。图割是一种优化技术，它将图像中的像素组织成一个图结构，其中节点代表像素，边代表像素之间的相似性或连接关系。权重通常由颜色、纹理和位置等特征决定。然后，图割算法会寻找最小割，将图像分割为两个部分，对应于前景和背景。在原始的GrabCut算法基础上，本研究提出了一个增强的迭代版本。这个迭代优化过程允许算法逐步精炼分割结果，通过多次切割和调整，提高分割的准确性和细致程度。这意味着用户不必一次性提供非常精确的输入，算法自身能通过迭代学习逐步完善分割效果。此外，GrabCut算法的一个显著改进是简化用户交互。用户只需大致围绕目标物体画一个矩形框，算法就能自动识别并提取出目标。这大大降低了用户的工作量，尤其对于非专业用户来说，这样的交互方式更加友好。最后，为了提高边缘处理的准确性，算法引入了边框贴图（Border Matting）技术。边框贴图是一种估计对象边界处混合颜色的技术，它可以生成一个Alpha遮罩，Alpha值表示像素属于前景或背景的程度。这种方法使得分割出的前景物体边缘更加平滑，避免了常见的锯齿现象，提高了整体的视觉效果。 GrabCut算法通过迭代优化、简化用户交互和引入边框贴图技术，极大地提升了图像分割的效率和质量，使其成为交互式图像编辑领域的一个强大工具。在实际应用中，例如图像合成、视频剪辑和虚拟现实等领域，GrabCut都有广泛的应用价值。

“GrabCut” — Interactive Foreground Extraction using Iterated Graph Cuts

Carsten Rother

∗

Vladimir Kolmogorov

†

Microsoft Research Cambridge, UK

Andrew Blake

‡

Figure 1: Three examples of GrabCut. The user drags a rectangle loosely around an object. The object is then extracted automatically.

Abstract

The problem of efﬁcient, interactive foreground/background seg-

mentation in still images is of great practical importance in im-

age editing. Classical image segmentation tools use either texture

(colour) information, e.g. Magic Wand, or edge (contrast) infor-

mation, e.g. Intelligent Scissors. Recently, an approach based on

optimization by graph-cut has been developed which successfully

combines both types of information. In this paper we extend the

graph-cut approach in three respects. First, we have developed a

more powerful, iterative version of the optimisation. Secondly, the

power of the iterative algorithm is used to simplify substantially the

user interaction needed for a given quality of result. Thirdly, a ro-

bust algorithm for “border matting” has been developed to estimate

simultaneously the alpha-matte around an object boundary and the

colours of foreground pixels. We show that for moderately difﬁcult

examples the proposed method outperforms competitive tools.

CR Categories: I.3.3 [Computer Graphics]: Picture/Image

Generation—Display algorithms; I.3.6 [Computer Graphics]:

Methodology and Techniques—Interaction techniques; I.4.6 [Im-

age Processing and Computer Vision]: Segmentation—Pixel clas-

siﬁcation; partitioning

Keywords: Interactive Image Segmentation, Graph Cuts, Image

Editing, Foreground extraction, Alpha Matting

1 Introduction

This paper addresses the problem of efﬁcient, interactive extrac-

tion of a foreground object in a complex environment whose back-

ground cannot be trivially subtracted. The resulting foreground ob-

ject is an alpha-matte which reﬂects the proportion of foreground

and background. The aim is to achieve high performance at the

cost of only modest interactive effort on the part of the user. High

performance in this task includes: accurate segmentation of object

from background; subjectively convincing alpha values, in response

to blur, mixed pixels and transparency; clean foreground colour,

∗

e-mail: carrot@microsoft.com

†

e-mail: vnk@microsoft.com

‡

e-mail: ablake@microsoft.com

free of colour bleeding from the source background. In general,

degrees of interactive effort range from editing individual pixels, at

the labour-intensive extreme, to merely touching foreground and/or

background in a few locations.

1.1 Previous approaches to interactive matting

In the following we describe brieﬂy and compare several state of

the art interactive tools for segmentation: Magic Wand, Intelligent

Scissors, Graph Cut and Level Sets and for matting: Bayes Matting

and Knockout. Fig. 2 shows their results on a matting task, together

with degree of user interaction required to achieve those results.

Magic Wand starts with a user-speciﬁed point or region to com-

pute a region of connected pixels such that all the selected pixels

fall within some adjustable tolerance of the colour statistics of the

speciﬁed region. While the user interface is straightforward, ﬁnding

the correct tolerance level is often cumbersome and sometimes im-

possible. Fig. 2a shows the result using Magic Wand from Adobe

Photoshop 7 [Adobe Systems Incorp. 2002]. Because the distri-

bution in colour space of foreground and background pixels have a

considerable overlap, a satisfactory segmentation is not achieved.

Intelligent Scissors (a.k.a. Live Wire or Magnetic Lasso)

[Mortensen and Barrett 1995] allows a user to choose a “minimum

cost contour” by roughly tracing the object’s boundary with the

mouse. As the mouse moves, the minimum cost path from the cur-

sor position back to the last “seed” point is shown. If the computed

path deviates from the desired one, additional user-speciﬁed “seed”

points are necessary. In ﬁg. 2b the Magnetic Lasso of Photoshop 7

was used. The main limitation of this tool is apparent: for highly

texture (or un-textured) regions many alternative “minimal” paths

exist. Therefore many user interactions (here 19) were necessary to

obtain a satisfactory result. Snakes or Active Contours are a related

approach for automatic reﬁnement of a lasso [Kass et al. 1987].

Bayes matting models colour distributions probabilistically to

achieve full alpha mattes [Chuang et al. 2001] which is based on

[Ruzon and Tomasi 2000]. The user speciﬁes a “trimap” T =

} in which background and foreground regions T

and

are marked, and alpha values are computed over the remain-

ing region T

. High quality mattes can often be obtained (ﬁg.

2c), but only when the T

region is not too large and the back-

ground/foreground colour distributions are sufﬁciently well sepa-

rated. A considerable degree of user interaction is required to con-

struct an internal and an external path.

Knockout 2 [Corel Corporation 2002] is a proprietary plug-in for

Photoshop which is driven from a user-deﬁned trimap, like Bayes

matting, and its results are sometimes similar (ﬁg. 2d), sometimes

of less quality according to [Chuang et al. 2001].

下载后可阅读完整内容，剩余5页未读，立即下载

uglyfight

粉丝: 0

交互式图像分割：GrabCut 迭代图割算法

GrabCut -Interactive Foreground Extraction using Iterated Graph Cuts

“GrabCut” — Interactive Foreground Extraction using Iterated Graph Cuts

前景提取matlab代码-GrabCut:GrabCut：使用迭代图切割的交互式前景提取

图像分割 grabcut C++版本的源码，包含max-flow源码

MSRM Based Object Extraction Method for Image Sequences.pdf

GrabCut算法：从图像分割到前景提取

C++ 实现新年倒计时与烟花显示效果的图形界面程序

儿歌、手指谣、律动.docx

基于Msp430设计的环境监测系统（完整系统源码等资料）实物仿真.zip

基于COMSOL仿真的电磁超声压电接收技术在铝板裂纹检测中的应用研究,COMSOL模拟：电磁超声压电接收技术在铝板裂纹检测中的应用,comsol电磁超声压电接收EMAT 在1mm厚铝板中激励250kH

最新资源