深度半监督学习新进展：模型设计与无监督方法综览

需积分: 49 174 浏览量更新于2024-07-15 4 收藏 2.41MB PDF 举报

身份认证购VIP最低享 7 折!

领优惠券(最高得80元）

深度半监督学习是当前人工智能领域的热点，特别是在图像分类、自然语言处理等任务中展现出了巨大的潜力。本文，由Xiangli Yang、Zixing Song、Irwin King（IEEE Fellow）和Zenglin Xu（IEEE Senior Member）共同撰写，旨在提供一个全面的2021年深度半监督学习综述。该研究聚焦于深度学习模型设计和无监督损失函数这两个关键角度。首先，作者构建了一个深度半监督学习的分类框架，将现有方法划分为五大类别：深度生成方法、一致性正则化方法、图基方法、伪标签法以及混合方法。这些方法各有特点，深度生成方法通过生成器模型利用未标注数据的潜在分布；一致性正则化则是通过确保模型在不同扰动下的预测保持一致来提高泛化能力；图基方法则利用图结构进行知识传播；伪标签法则是基于模型的预测给未标注数据赋予标签，用于训练；而混合方法则结合了多种策略，以期取得更好的性能。在详细比较这些方法时，作者分析了它们各自的损失类型、贡献以及架构差异。重点展示了近年来在这个领域取得的主要进展，包括但不限于新颖的网络结构设计、更有效的无监督学习策略和更深入的理论理解。然而，尽管取得了显著的进步，文章也指出了现有方法的一些局限性，如对复杂场景的适应性不足、过度拟合问题以及如何有效利用大规模未标注数据等挑战。为了应对这些问题，作者提出了一些初步的解决思路和未来可能的研究方向，以期推动深度半监督学习技术向更高层次发展。这篇综述论文不仅概述了深度半监督学习的基本原理和核心技术，还为研究人员和从业者提供了宝贵的参考资源，帮助他们理解和改进现有的方法，同时为解决当前面临的开放性问题提供了方向。对于那些关注深度学习、机器学习特别是图像分类的读者来说，这是一篇不可多得的深度阅读材料。

资源详情

资源推荐

Fig. 2. A glimpse of the diverse range of architectures used for GAN-based deep generative semi-supervised methods. The characters ‘‘D, G” and

“E” represent Discriminator, Generator and Encoder, respectively. In Figure (6), Localized GAN is equipped with a local generator G(x, z), so we

use the yellow box to distinguish it. Similarly, in CT-GAN, the purple box is used to denote a discriminator that introduces consistency constraint.

the standard GAN (Eq. (2)). The structure is illustrated in

Fig. 2(3). This method aims to learn a discriminator which

distinguishes the samples into K categories by labeling y

to each x, instead of learning a binary discriminator value

function. Moreover, the CatGAN discriminator loss function

the supervised loss is also a cross-entropy term between

the predicted conditional distribution p(y|x, D) and the

true label distribution of examples.n consists of three parts:

(1) entropy H[p(y|x, D)] which to obtain certain category

assignment for samples; (2) H[p(y|G(z), D)] for uncertain

predictions from generated samples; (3) the marginal class

entropy H[p(y|D)] to uniform usage of all classes. The

proposed framework uses the feature space learned by the

discriminator for the ﬁnal learning task. For the labeled data,

the supervised loss is also a cross-entropy term between

the conditional distribution p(y|x, D) and the samples’ true

label distribution.

CCGAN. Context-Conditional Generative Adversarial

Networks (CCGAN) [141] is proposed to use an adversarial

loss for harnessing unlabeled image data based on image

in-painting. The architecture of the CCGAN is shown in

Fig. 2(4). The main highlight of this work is context infor-

mation provided by the surrounding parts of the image. The

method trains a GAN where the generator is to generate pix-

els within a missing hole. The discriminator is to discrimi-

nate between the real unlabeled images and these in-painted

images. More formally, m  x as input to a generator, where

m denotes a binary mask to drop out a speciﬁed portion

of an image and  denotes element-wise multiplication.

Thus the in-painted image x

= (1 − m)  x

+ m  x

with generator outputs x

= G(m  x, z). The in-painted

examples provided by the generator cause the discrimina-

tor to learn features that generalize to the related task of

classifying objects. The penultimate layer of components of

the discriminator is then shared with the classiﬁer, whose

cross-entropy loss is used combined with the discriminator

loss.

Improved GAN. There are several methods to adapt

GANs into a semi-supervised classiﬁcation scenario. Cat-

GAN [140] forces the discriminator to maximize the mutual

information between examples and their predicted class dis-

tributions instead of training the discriminator to learn a bi-

nary classiﬁcation. To overcome the learned representations’

bottleneck of CatGAN, Semi-supervised GAN (SGAN) [142]

learns a generator and a classiﬁer simultaneously. The clas-

siﬁer network can have (K + 1) output units corresponding

to [y

, y

, . . . , y

, y

K+1

], where the y

K+1

represents the

outputs generated by G. Similar to SGAN, Improved GAN

[143] solves a (K +1)-class classiﬁcation problem. The struc-

ture of Improved GAN is shown in Fig. 2(5). Real examples

for one of the ﬁrst K classes and the additional (K + 1)th

class consisted of the synthetic images generated by the

generator G. This work proposes the improved techniques

to train the GANs, i.e., feature matching, minibatch dis-

crimination, historical averaging one-sided label smoothing,

and virtual batch normalization, where feature matching is

剩余23页未读，继续阅读

syp_net

粉丝: 158
资源: 1187

深度半监督学习新进展：模型设计与无监督方法综览

半监督学习综述(a survey of semi-supervised learning)

深度主动学习综述论文

《深度半监督学习》综述论文

科大讯飞深度学习需要掌握哪些东西

深度学习 中科院国科大

电子科技大学matlab

ubuntu电子科大源

西安电子科技大学latex

电子科技大学 图论 csdn

国科大 2021高级ai

电子科技大学深圳高等研究院

慕课mooc西安电子科技大学人工智能导论答案

电子科技大学电子信息工程学院学硕

电子科技大学858历年真题pdf

高频电子线路西安电子科技大学pdf

模拟coms集成电路设计简版西安电子科技大学

电子科技大学 人工智能基础 答案 csdn

数字信号处理实验指导书matlab版答案西安电子科技大学

国科大 模式识别与机器学习

电子科技大学820真题1999-2019终极版.pdf

最新资源

深度学习中科院国科大

电子科技大学图论 csdn

电子科技大学人工智能基础答案 csdn

国科大模式识别与机器学习