基于掩模的条件GAN人脸编辑框架

需积分: 0 168 浏览量更新于2024-08-05 收藏 5.85MB PDF 举报

"借助掩模的人脸编辑-20191" 本文主要探讨了使用条件生成对抗网络（Conditional GANs）进行掩模引导的人脸编辑技术。在当前的肖像编辑领域，尤其是随着生成对抗网络（GAN）的发展，生成逼真人脸和实现更多面部编辑已经成为可能。然而，现有的技术在多样性、质量以及可控性方面仍存在挑战。作者针对这些问题提出了一种创新的端到端框架，旨在改善和增强人像合成与编辑的性能。在摘要中，作者提到了三个关键问题： 1. **多样性**：现有的技术往往难以生成多样且真实的人脸，这意味着生成的人脸可能缺乏变化，无法满足不同的需求。 2. **质量**：尽管GAN已经在生成逼真图像方面取得了进展，但仍然存在生成图像质量不一致或存在瑕疵的问题。 3. **可控性**：对于肖像编辑，用户通常希望有更高的控制力，能够精确地改变特定面部组件的形状或外观。为了解决这些挑战，作者提出了一个基于条件GAN的新框架，该框架具有以下特点： - **Mask2image**：使用输入的目标掩模（如图1中的左下角图像），该框架可以生成多样化且逼真的人脸。这允许用户通过简单的掩模定义来控制生成结果的结构。 - **Component editing**：用户可以通过编辑掩模来改变脸部组件（如嘴巴、眼睛、头发）的形状，增加了编辑的灵活性和可控性。 - **Component transfer**：框架还支持组件的外观转移，例如改变头发颜色。这使得用户能够在保持组件形状的同时，调整其视觉特征。论文的这一方法利用了条件GAN的强大能力，通过学习大量面部图像的分布，生成的人脸不仅真实度高，而且可以根据掩模的指导进行定制化编辑。这为肖像编辑提供了新的工具，增强了用户的创意控制，同时提升了生成图像的质量和多样性。这篇论文的工作为肖像编辑领域提供了一个先进的解决方案，通过掩模引导的条件GAN，提高了生成肖像的多样性和质量，并实现了对特定面部组件的精细编辑。这一技术有望在人像处理软件、虚拟现实、电影特效等领域得到广泛应用。

Mask-Guided Portrait Editing with Conditional GANs

Shuyang Gu

Jianmin Bao

Hao Yang

Dong Chen

Fang Wen

Lu Yuan

University of Science and Technology of China

Microsoft Research

{gsy777,jmbao}@mail.ustc.edu.cn {haya,doch,fangwen,luyuan}@microsoft.com

(a) Mask2image

(b) Component editing

Figure 1: We propose a framework based on conditional GANs for mask-guided portrait editing. (a) Our framework can generate diverse

and realistic faces using one input target mask (lower left corner in the ﬁrst image). (b) Our framework allows us to edit the mask to change

the shape of face components, i.e. mouth, eyes, hair. (c) Our framework also allows us to transfer the appearance of each component for a

portrait, including hair color.

Abstract

Portrait editing is a popular subject in photo manipula-

tion. The Generative Adversarial Network (GAN) advances

the generating of realistic faces and allows more face edit-

ing. In this paper, we argue about three issues in existing

techniques: diversity, quality, and controllability for por-

trait synthesis and editing. To address these issues, we pro-

pose a novel end-to-end learning framework that leverages

conditional GANs guided by provided face masks for gener-

ating faces. The framework learns feature embeddings for

every face component (e.g., mouth, hair, eye), separately,

contributing to better correspondences for image transla-

tion, and local face editing. With the mask, our network is

available to many applications, like face synthesis driven

by mask, face Swap+ (including hair in swapping), and lo-

cal manipulation. It can also boost the performance of face

parsing a bit as an option of data augmentation.

1. Introduction

Portrait editing is of great interest in the vision and

graphics community due to its potential applications in

movies, gaming, photo manipulation and sharing, etc. Peo-

ple enjoy the magic that makes faces look more interesting,

funny, and beautiful, which appear in an amount of popular

apps, such as Snapchat, Facetune, etc.

Recently, advances in Generative Adversarial Networks

(GANs) [16] have made tremendous progress in synthesiz-

ing realistic faces [1, 29, 25, 12], like face aging [46], pose

changing [44, 21] and attribute modifying [4]. However,

these existing approaches still suffer from some quality is-

sues, like lack of ﬁne details in skin, difﬁculty in dealing

with hair and background blurring. Such artifacts cause

generated faces to look unrealistic.

To address these issues, one possible solution is to use

the facial mask to guide generation. On one hand, a face

mask provides a good geometric constraint, which helps

synthesize realistic faces. On the other hand, an accurate

contour for each facial component (e.g., eye, mouth, hair,

etc.) is necessary for local editing. Based on the face mask,

some works [40, 14] achieve very promising results in por-

trait stylization. However, these methods focus on transfer-

ring the visual style (e.g., B&W, color, painting) from the

reference face to the target face. It seems to be unavailable

for synthesizing different faces, or changing face compo-

nents.

Some kinds of GAN models begin to integrate the face

mask/skeleton for better image-to-image translation, for

arXiv:1905.10346v1 [cs.CV] 24 May 2019

下载后可阅读完整内容，剩余9页未读，立即下载

粉丝: 874
资源: 314

基于掩模的条件GAN人脸编辑框架

集成电路掩模设计－基础版图技术

倒角掩模的优化：此提交包含使用 Farey 序列优化倒角掩模的代码-matlab开发

大创-大学生创新创业训练计划项目申报书-软件-基于人脸先验信息的人脸图像补全研究-参考

histml = cv2.calcHist([255-mask],[0],None,[256],(0,255))的意思

itk-snap能不能直接保存mask掩模

在深度学习中，如何应用掩模时域解码器进行实时语音降噪？请详细描述该技术的工作流程。

opencv肤色分割人脸识别

matlab 掩模,应用掩模在MATLAB

matlab中u-net的实现

最新资源