利用未标记数据进行班级增量学习：LACU框架与方法

178 浏览量更新于2024-08-26 收藏 1.96MB PDF 举报

"这篇研究论文提出了一种名为LACU（Learning with Augmented Class by Exploiting Unlabeled Data）的框架，旨在解决类增量学习（Class-Incremental Learning，C-IL）中的挑战。在C-IL问题中，学习系统需要处理未见过的新类别数据，而这些数据在训练阶段并未提供。为了应对这一挑战，作者提出了利用未标记数据来辅助学习过程，以提升系统对新类别和已知类别识别的准确性。LACU框架结合了LACU-SVM方法，通过分析未标记数据中的结构信息，降低不同类别间的错误分类风险。实验证明，这种方法在多个数据集上表现出良好的效果。" 在开放和渐变的学习环境中，学习系统的适应性和检测变化的能力至关重要。类增量学习（C-IL）就是针对这种环境设计的一种学习策略，它关注如何处理新出现的、训练阶段未曾遇到的类别数据。C-IL的主要难题在于，系统需要正确区分新类别和已知类别，但缺乏新类别的样本来进行训练。 LACU框架是为了解决这个问题而提出的创新解决方案。该框架的核心思想是利用未标记数据来帮助学习过程。由于未标记数据在很多实际应用中容易获取，因此可以作为一种宝贵的资源。LACU框架结合了未标记数据中的结构信息，这有助于学习系统理解不同类别的边界和特征，从而减少错误分类的风险。具体来说，LACU-SVM方法是LACU框架的一部分，它利用支持向量机（SVM）的分类能力，同时考虑未标记数据的特性。SVM通常依赖于标记数据，但在LACU-SVM中，未标记数据被纳入到训练过程中，用于改进分类器对已知类别和新类别边界的估计。通过这种方式，即使在没有新类别实例的情况下，系统也能更好地适应和预测新类别。实验部分，研究者在多个数据集上测试了LACU方法的有效性，结果证明了利用未标记数据能有效提高类别增量学习的性能。这表明，LACU框架和LACU-SVM方法为处理不断变化的环境提供了新的视角，对于未来的类增量学习研究具有重要的启示意义。总结起来，这篇研究论文通过提出LACU框架和LACU-SVM方法，展示了未标记数据在类增量学习中的潜力，为处理开放环境中的学习问题提供了新的工具和理论基础。这一工作对于理解如何在数据不完全的情况下实现有效的学习算法，特别是在需要持续适应新信息的系统中，具有深远的实践价值。

Learning with Augmented Class by Exploiting Unlabeled Data

⇤

Qing Da Yang Yu Zhi-Hua Zhou

National Key Laboratory for Novel Software Technology

Nanjing University, Nanjing 210023, China

{daq, yuy, zhouzh}@lamda.nju.edu.cn

Abstract

In many real-world applications of learning, the envi-

ronment is open and changes gradually, which requires

the learning system to have the ability of detecting and

adapting to the changes. Class-incremental learning (C-

IL) is an important and practical problem where data

from unseen augmented classes are fed, but has not been

studied well in the past. In C-IL, the system should be-

ware of predicting instances from augmented classes as

a seen class, and thus faces the challenge that no such

instances were observed during training stage. In this

paper, we tackle the challenge by using unlabeled data,

which can be cheaply collected in many real-world ap-

plications. We propose the LACU framework as well

as the LACU-SVM approach to learn the concept of

seen classes while incorporating the structure presented

in the unlabeled data, so that the misclassiﬁcation risks

among the seen classes as well as between the aug-

mented and the seen classes are minimized simultane-

ously. Experiments on diverse datasets show the effec-

tiveness of the proposed approach.

Introduction

Traditional machine learning approaches face many chal-

lenges raised in real-world applications, where the open

and dynamic environments break the stationary settings im-

plied in traditional approaches. A branch of methods dealing

with the changing environments is the incremental learn-

ing, which mainly includes sub-branches of the example-

incremental learning (E-IL) (Ruping 2001; Polikar et al.

2001; Fern and Givan 2003), the attribute-incremental learn-

ing (A-IL) (Vapnik, Vashist, and Pavlovitch 2009), the class-

incremental learning (C-IL) (Fink et al. 2006; Muhlbaier,

Topalis, and Polikar 2009; Kuzborskij, Orabona, and Caputo

2013) as concluded in (Zhou and Chen 2002). Among them,

C-IL is an important problem which is often encountered

in practice. For example, in building an image classiﬁcation

system for pictures in the Internet, the user may only label a

few classes, say the dog, ﬁsh and bird. However, the system

has to predict images from wide classes in the future. When

⇤

This research was supported by the 973 Program

(2014CB340501), NSFC (61333014, 61375061) and Jiang-

suSF (BK2012303).

 2014, Association for the Advancement of Artiﬁcial

fish

dog

bird

Figure 1: An illustration that unlabeled data helps the learn-

ing with augmented class problem.

an image of tiger comes, a traditional classiﬁcation algo-

rithm will predict it in seen classes, like dog, which could

make the system unusable.

This paper investigates one of the core problems in C-

IL, i.e., how to recognize instances from unseen augmented

classes. An augmented class is a class which is unknown

during the training stage, but appears in the test stage. Once

the system can tell the augmented classes from the seen

ones, latter processing of the augmented classes can be han-

dled. Therefore, we would like the system to report an ex-

tra option to denote that an instance is from the augmented

class, with a high accuracy.

Speciﬁcally, the learning with augmented class (LAC)

problem, is given a training dataset D = {(x

)}

i=1

where x

2 R

is an training instance and y

2 Y =

{1, 2,...,K} is the associated class label. Unlike the canoni-

cal classiﬁcation, during test, we need to predict the class of

the instances from an open dataset D

= {x

}

i=1

, where

2 Y

= {1, 2,...,K,K +1,...,M} with M>K. As

there are classes unobservable during the training time, the

goal of learning with augmented class is to learn a model

f(x):X ! Y

= {1, 2,...,K,novel}, where the option

novel indicates that x belongs to the augmented class, in

order to minimize following expected risk

⇤

= argmin

f2H

(x,y)⇠ D

err(y, f (x)), (1)

where H is a hypnosis space and err is LAC error

err(y, f (x)) =

⇢

I(f(x) 6= y),y2 Y

I(f(x) 6= novel),y/2 Y

(2)

Here I(expression) is an indicator function which equals 1

when the expression is true and 0 otherwise.

Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence

1760

下载后可阅读完整内容，剩余6页未读，立即下载

weixin_38690545

粉丝: 4
资源: 927

利用未标记数据进行班级增量学习：LACU框架与方法

基于多视图未标记数据的机器学习.pdf

无标记数据学习, 一致性学习与自监督学习是什么？【Google AI-Luong, 83ppt】.zip

利用未标记数据：半监督学习详解

利用未标记数据提升班级增量学习的框架

利用未标记数据：半监督学习与协同训练解析

YOLOv3图像分类弱监督学习秘籍：利用未标记数据提升模型性能，降低数据标注成本

双向主动学习：对未标记和标记数据集的双向探索

使用未标记的数据进行存储适合的学习

通过神经网络从未标记的数据中转移学习

利用未标记数据提取特权信息提升分类器性能

最新资源