大规模发型生成与编辑：Hairstyle30k与H-GAN模型

PDF格式 | 1.13MB | 更新于2024-08-26 | 79 浏览量 | 举报

1 收藏

本文主要探讨了在计算机视觉领域中学习生成和编辑发型的重要性和挑战。当前，由于缺乏大规模的发型数据集，如Beauty-Expert数据集，这限制了开发和评估深度生成模型，特别是生成对抗网络（GAN）在发型分类、合成和图像编辑方面的应用。作者针对这一问题，提出了一个新的大型发型数据集——Hairstyle30k，包含30,000张图片，涵盖了64种不同的发型类型，这对于训练和评估模型具有显著的价值。 Hairstyle30k数据集的构建旨在解决小样本数据导致的模型性能受限问题。为了实现自动化地生成和修改图像中的发型，作者还提出了一种新颖的GAN模型——Hairstyle GAN（H-GAN）。H-GAN设计得更为高效，能够在处理发型生成和编辑任务时展现出更好的效果。通过在新数据集Hairstyle30k以及现有基准数据集上的大量实验，论文证明了H-GAN模型的有效性，它不仅能提升发型分类的准确性，还能生成逼真的发型合成图像，并允许对现有图像进行细致的发型编辑。本文的关键技术包括： 1. 大规模发型数据集：Hairstyle30k的数据收集和标注工作，提供了丰富的样式和多样性的样本，对于研究者来说是宝贵的资源，使得模型能够学习到更广泛的发型特征。 2. Hairstyle GAN (H-GAN)：这是一种创新的生成模型，采用了生成对抗网络的原理，通过训练生成器和判别器来学习发型的潜在分布，从而实现高质量的发型生成和编辑。H-GAN的优势在于其高效的学习能力和生成结果的自然度。 3. 实验验证：通过对比实验，展示了H-GAN在Hairstyle30k数据集以及其他基准数据集上的性能优势，证明了其在发型相关任务上的实用性。这篇研究论文为生成和编辑发型的计算机视觉技术提供了重要的理论支持和实践案例，推动了该领域的研究进展，尤其是在面对大规模数据集需求时，H-GAN模型的应用前景广阔。

Learning to Generate and Edit Hairstyles

Weidong Yin

, Yanwei Fu

1∗

Yiqing Ma

, Yu-Gang Jiang

, Tao Xiang

, Xiangyang Xue

1,2

School of Data Science, Fudan University;

School of Computer Science, Fudan University;

Queen Mary University of London

ABSTRACT

Modeling hairstyles for classication, synthesis and image edit-

ing has many practical applications. However, existing hairstyle

datasets, such as the Beauty e-Expert dataset, are too small for

developing and evaluating computer vision models, especially the

recent deep generative models such as generative adversarial net-

work (GAN). In this paper, we contribute a new large-scale hairstyle

dataset called Hairstyle30k, which is composed of 30k images con-

taining 64 dierent types of hairstyles. To enable automated gener-

ating and modifying hairstyles in images, we also propose a novel

GAN model termed Hairstyle GAN (H-GAN) which can be learned

eciently. Extensive experiments on the new dataset as well as

existing benchmark datasets demonstrate the eectiveness of pro-

posed H-GAN model.

KEYWORDS

Hairstyle Dataset, Hairstyle Classication, Generative Adversarial

Networks

ACM Reference format:

Weidong Yin

, Yanwei Fu

and Yiqing Ma

, Yu-Gang Jiang

, Tao Xiang

Xiangyang Xue

1, 2

. 2017. Learning to Generate and Edit Hairstyles. In

Proceedings of MM’17, October 23–27, 2017, Mountain View, CA, USA., ,

9 pages.

DOI: https://doi.org/10.1145/3123266.3123423

1 INTRODUCTION

Hairstyle can express one’s personalities, self-condence, and at-

titudes. It is thus an important aspect of personal appearance. A

computer vision model that enables recognition, synthesis, and

modication of hairstyles in images is of great practical use. For ex-

ample, with such as model, customer can take a photo of him/herself

and then synthesize dierent hairstyles before going to the hair-

dresser’s to make the most satisfactory one a reality. In addition, an

automated hairstyle recognition model can be used for recognizing

person’s identity for security applications.

Existing eorts on hairstyle modeling have been focused on rec-

ommending the most suitable hairstyles [

], or interactively users’

editing [

]. However, there is no attempt so far to systemat-

ically study hairstyles in images and no model available that can

∗

Dr. Yanwei Fu is the corresponding author. Email: yanweifu@fudan.edu.cn

Permission to make digital or hard copies of all or part of this work for personal or

classroom use is granted without fee provided that copies are not made or distributed

for prot or commercial advantage and that copies bear this notice and the full citation

on the rst page. Copyrights for components of this work owned by others than ACM

must be honored. Abstracting with credit is permitted. To copy otherwise, or republish,

to post on servers or to redistribute to lists, requires prior specic permission and/or a

fee. Request permissions from permissions@acm.org.

MM’17, October 23–27, 2017, Mountain View, CA, USA.

DOI: https://doi.org/10.1145/3123266.3123423

address various hairstyle modeling task in a comprehensive manner.

One of the reasons is that there are large variations in hairstyles

and in order to model these variations, large-scale datasets are

needed. Unfortunately, such a large-scale hairstyle dataset does not

exist. In Multimedia and computer vision communities, hairstyles

are often labeled as attributes for face datasets. However, such

annotation is often crude, focusing mostly hair length and color.

On the other hand, existing specialized hairstyle datasets such as

Beauty e-Expert dataset [

] are too small to represent the diversity

of human hairstyles in the wild.

In this paper, we introduce the rst large-scale hairstyle dataset

– Hairstyle30

to the community and hope that this will greatly

boost the research into hairstyle modeling. Images in the dataset

(see Fig. 1 for examples) are collected from the Web via search

engines using keywords corresponding a hairstyle ontology. This

results in 64 dierent types of hairstyles in 30

images. On average,

each hairstyle class has around 480 images. The newly proposed

dataset is used to train the H-GAN model proposed in this paper.

Importantly, with 64 hairstyle classes, this is a ne-grained dataset

presenting a challenging recognition task, as veried by our exper-

iments.

Apart from releasing a new dataset, we also present a Hairstyle

Generative Adversarial Network (H-GAN) model for automati-

cally generating or modifying/editing hairstyles given an input

image. Our H-GAN has three components: an encoder-decoding

sub-network, a GAN and a recognition subnetwork. Particularly,

the encoder-decoding network is a variant of Variational Auto-

Encoders (VAE) [

]; the recognition sub-network shares the same

networks as the discriminator of GAN as in InfoGAN [

]. The

model is unique in that once trained, it can be used to perform

various tasks including recognition, synthesis and modication.

Extensive experiments of our H-GAN algorithm on the proposed

dataset and other general-purpose benchmark datasets validate the

ecacy of our model.

Contributions

. We make several contributions in this paper. Firstly,

to study the hairstyle related problems, we contribute a new large-

scale hairstyle dataset – Hairstyle30

to the community. To the best

of our knowledge, this is the largest hairstyle dataset, especially in

terms of the number of hairstyle classes. Secondly, we present a

new deep generative model called – H-GAN which can eectively

and eciently generate and modify the hairstyles of person images.

Extensive experiments demonstrate that our H-GAN is superior to

a number of state-of-the-art alternative models.

2 RELATED WORK

2.1 Image Editing and Synthesis

Editing image with interaction.

Recent advances in interactive

image segmentation have signicantly simplied the tasks of object

Session: Fast Forward 6

MM’17, October 23-27, 2017, Mountain View, CA, USA

1627

下载后可阅读完整内容，剩余8页未读，立即下载

weixin_38547887

粉丝: 5

大规模发型生成与编辑：Hairstyle30k与H-GAN模型

掌握HairstyleGAN：Python实现的发型生成模型

SC-FEGAN：基于草图和颜色的人脸编辑生成网络

CLIP驱动的文本引导发型编辑新方法

mayanhair人头基本发型制作基础.pdf

TensorFlow人脸属性操控网络教程：自由编辑与生成

Face++深度学习发型推荐系统及爬虫数据集源码

SC-FEGAN：深度学习实现用户草图和颜色的面部编辑

Go语言Excel批量生成工具：轻松读取模板快速生成

FaderNetworks的PyTorch实现：性别和年龄图像生成

Unity快速生成逼真人脸建模插件

最新资源