深度图像抠图：解决复杂纹理与相似颜色的难题

需积分: 50 184 浏览量更新于2024-09-11 收藏 4.44MB PDF 举报

"Deep Image Matting 是一种计算机视觉技术，用于分离图像中的前景与背景，具有广泛应用。以往的方法在处理颜色相似或纹理复杂的图像时表现不佳。本文提出了一种新颖的深度学习算法，解决了这两个问题。该深度模型由两部分组成：一个深度卷积编码解码网络预测图像的Alpha matte，另一个小型卷积网络则细化第一部分的预测，以获得更精确的Alpha值和更清晰的边缘。此外，还构建了一个大规模的图像 matting 数据集，包括49300张训练图像和1000张测试图像。" 深度图像 matting 是计算机视觉领域的一个核心问题，它涉及到对图像中对象的精细分割，尤其是在前景和背景颜色接近、纹理复杂的情况下。传统方法往往依赖低级特征，缺乏高级语境的理解，导致处理效果不尽人意。本研究提出了一种基于深度学习的新方法，该方法由两个主要部分构成。首先，采用深度卷积编码解码网络（Deep Convolutional Encoder-Decoder Network），该网络接收图像和对应的 trimap（一种指导分割的半监督输入）作为输入，然后预测图像的Alpha matte。Alpha matte是表示前景与背景混合程度的透明度图，其中Alpha值介于0（完全透明，背景）和1（完全不透明，前景）之间。其次，设计了一个小型卷积网络，其任务是对第一部分生成的Alpha matte预测进行细化。这个细化过程旨在提升Alpha值的准确性，同时增强边缘的清晰度，这对于准确地分离前景和背景至关重要。这一步可以看作是深度学习模型的后处理步骤，用于优化初步预测的结果。为了训练这个深度学习模型，研究者创建了一个大规模的图像 matting 数据集，包含49300张训练图像和1000张测试图像。这样的数据集规模对于深度学习模型的训练来说是必不可少的，因为它提供了丰富的样本，使得模型能够学习到各种场景和条件下的图像 matting 特征，从而提高泛化能力。 "Deep Image Matting" 提出的深度学习算法通过结合低级特征和高级上下文信息，显著改善了在处理颜色相近和纹理复杂的图像时的 matting 性能，并且利用大规模数据集进行训练，提升了模型的准确性和鲁棒性。这种方法不仅对计算机视觉研究有重大意义，而且在电影制作、虚拟现实、图像编辑等应用领域具有广阔的应用前景。

Deep Image Matting

Ning Xu

1,2

, Brian Price

, Scott Cohen

, and Thomas Huang

1,2

Beckman Institute for Advanced Science and Technology

University of Illinois at Urbana-Champaign

Adobe Research

{ningxu2,t-huang1}@illinois.edu, {bprice,scohen}@adobe.com

Abstract

Image matting is a fundamental computer vision prob-

lem and has many applications. Previous algorithms have

poor performance when an image has similar foreground

and background colors or complicated textures. The main

reasons are prior methods 1) only use low-level features and

2) lack high-level context. In this paper, we propose a novel

deep learning based algorithm that can tackle both these

problems. Our deep model has two parts. The ﬁrst part is a

deep convolutional encoder-decoder network that takes an

image and the corresponding trimap as inputs and predict

the alpha matte of the image. The second part is a small

convolutional network that reﬁnes the alpha matte predic-

tions of the ﬁrst network to have more accurate alpha values

and sharper edges. In addition, we also create a large-scale

image matting dataset including 49300 training images and

1000 testing images. We evaluate our algorithm on the im-

age matting benchmark, our testing set, and a wide variety

of real images. Experimental results clearly demonstrate

the superiority of our algorithm over previous methods.

1. Introduction

Matting, the problem of accurate foreground estimation

in images and videos, has signiﬁcant practical importance.

It is a key technology in image editing and ﬁlm production

and effective natural image matting methods can greatly im-

prove current professional workﬂows. It necessitates meth-

ods that handle real world images in unconstrained scenes.

Unfortunately, current matting approaches do not gen-

eralize well to typical everyday scenes. This is partially

due to the difﬁculty of the problem: as formulated the mat-

ting problem is underconstrained with 7 unknown values

per pixel but only 3 known values:

= α

+ (1 − α

∈ [0, 1]. (1)

where the RGB color at pixel i, I

, is known and the fore-

ground color F

, background color B

and matte estimation

are unknown. However, current approaches are further

limited in their approach.

The ﬁrst limitation is due to current methods being de-

signed to solve the matting equation (Eq. 1). This equa-

tion formulates the matting problem as a linear combina-

tion of two colors, and consequently most current algo-

rithms approach this largely as a color problem. The stan-

dard approaches include sampling foreground and back-

ground colors [3, 9], propagating the alpha values accord-

ing to the matting equation [14, 31, 22], or a hybrid of the

two [32, 13, 28, 16]. Such approaches rely largely on color

as the distinguishing feature (often along with the spatial

position of the pixels), making them incredibly sensitive to

situations where the foreground and background color dis-

tributions overlap, which unfortunately for these methods is

the common case for natural images, often leading to low-

frequency “smearing” or high-frequency “chunky” artifacts

depending on the method (see Fig 1 top row). Even the re-

cently proposed deep learning methods are highly reliant on

color-dependent propagation methods [8, 29].

A second limitation is due to the focus on a very small

dataset. Generating ground truth for matting is very difﬁ-

cult, and the alphamatting.com dataset [25] made a signiﬁ-

cant contribution to matting research by providing ground-

truth data. Unfortunately, it contains only 27 training im-

ages and 8 test images, most of which are objects in front

of an image on a monitor. Due to its size and constraints

of the dataset (e.g. indoor lab scenes, indoor lighting, no

humans or animals), it is by its nature biased, and methods

are incentivized to ﬁt to this data for publication purposes.

As is the case with all datasets, especially small ones, at

some point methods will overﬁt to the dataset and no longer

generalize to real scenes. A recent video matting dataset is

available [10] with 3 training videos and 10 test videos, 5

of which were extracted from green screen footage and the

arXiv:1703.03872v3 [cs.CV] 11 Apr 2017

下载后可阅读完整内容，剩余9页未读，立即下载

人工智能的弱者

粉丝: 8
资源: 24

深度图像抠图：解决复杂纹理与相似颜色的难题

Python-人像matting数据集包含34427张图像和对应的matting结果图

image matting

pytorch-deep-image-matting:深度图像遮罩的Pytorch实现

U2Net抠图算法详解与Deep Image Matting模型实现

这是论文“ Deep Image Matting”的张量流实现-Python开发

深度学习三维重建 Deep Image Matting-2017-CVPR (源码、原文）

基于深度学习的抠图工具(Deeplearning based image matting tool)

基于深度学习的抠图工具(Deeplearning based image matting tool).zip

AlphaNet_ An Attention Guided Deep Network for Automatic Image Matting.pdf

Deep_Image_Matting_Reproduce

最新资源