深度学习中的标签噪声表示学习：现状与未来

标签噪声

表示学习

需积分: 49 75 浏览量更新于2024-07-15 收藏 3.39MB PDF 举报

身份认证购VIP最低享 7 折!

领优惠券(最高得80元）

《标签噪声表示学习的研究：过去、现在和将来》是一篇深入探讨在实际场景中，机器学习中经典假设——训练数据标签来自干净分布的局限性的文章。随着深度学习模型在复杂任务中的广泛应用，面对大量可能存在噪声的标签数据，传统的统计学习方法往往无法提供足够的鲁棒性。因此，研究者们提出了标签噪声表示学习（Label-Noise Representation Learning, LNRL）的概念，旨在设计有效的方法来应对带有噪声标签的数据集，确保深度模型的稳健训练。文章首先明确阐述了LNRL的定义，强调了其在机器学习框架下的核心目标：处理和降低噪声标签对模型性能的影响。作者从学习理论的角度出发，分析了噪声标签如何影响深度模型的学习过程，揭示了噪声如何干扰模型对真实关系的理解和学习能力。文章进一步将现有的LNRL方法分类归纳为三个主要方向：1) 噪声识别与过滤：这类方法致力于识别并剔除或修正标签噪声，以提高模型训练的准确性；2) 鲁棒学习策略：通过设计特殊的损失函数或者训练策略，使模型对噪声不敏感，从而在噪声环境中保持稳定的表现；3) 合成标签或半监督学习：利用无噪声数据或额外信息生成伪标签，帮助模型在有限的可靠信息下学习。在每个类别中，作者详细探讨了各种技术的优缺点，比如基于模型的噪声检测方法可能依赖于特定假设，而某些半监督学习方法可能需要更多的预处理步骤。此外，文章还讨论了当前LNRL研究的挑战，如如何处理不同类型和程度的噪声、如何在噪声减少效果和计算效率之间找到平衡等。未来，作者展望了LNRL领域的发展趋势，包括但不限于结合元学习和迁移学习的技术改进、更深层次的理论探究以及在实际应用中如何将这些方法整合到工业级的深度学习系统中。《标签噪声表示学习的研究：过去、现在和将来》为理解和解决深度学习在实际场景中的噪声问题提供了全面且深入的视角，为研究人员和实践者提供了宝贵的参考和指导。

资源详情

资源推荐

A SURVEY OF LABEL-NOISE REPRESENTATION LEARNING, NOVEMBER 2020 5

TABLE 1

Illustrations of three LNRL examples based on Deﬁnition 2.2.

T E = (x, y) P

web-scale image classiﬁcation (ImageNet, crowdsourced labels) test accuracy

intelligent healthcare (medical data, annotations by variability) error rate

VoIP speech analysis (perceived speech, user feedback) quality rate of voice

which is different with LNRL, where labeled data are still

noisy to some degree. To address SSL, there are several

typical algorithms, such as [27], [48], [48], [49], [50].

•

Positive-unlabeled Learning (PUL) [51] learns the hypothesis

from experience

consisting of only positive labeled

and unlabeled data. Similar to SSL, unlabeled data will

be normally annotated by pseudo labels. However, PUL

assumes that labeled data are fully clean and only positive.

To address PUL, there are several typical algorithms, such

as [52], [53], [54].

•

Complementary Learning (CL) [55] speciﬁes a class that a

pattern does NOT belong to. Namely, CL learns the hypoth-

esis

from experience

consisting of only complementary

data. Since the labeling process cannot fully exclude the

uncertainty, namely belonging to which categories, CL

has some relatedness with LNRL. However, CL requires

that all diagonal entries of the transition matrix are zeros.

Sometimes, the transition matrix may be not required to

be invertible in empirical. To address CL, there are several

typical algorithms, such as [55], [56], [57], [58].

•

Unlabeled-unlabeled Learning (UUL) [59] is a recently pro-

posed learning paradigm, which allows us to train a binary

classiﬁer only from two unlabeled datasets with different

class priors. Different to SSL and PUL, there are two sets

of unlabeled data in UUL instead of one set. To address

UUL, there are two typical algorithms, including [59], [60].

3.4 Core Issue

When machine learning in an ideal environment, the data

should be with clean supervision. Therefore,

-risk under the

clean distribution should be as follows.

`,D

∗

) := E

(X,Y )∼D

[`(f

(X), Y )], (1)

where

(X, Y )

is the clean example i.i.d. drawn from clean

distribution

is a learning model (e.g., a deep neural

network) parameterized by

and

is normally cross-entropy

loss. In this survey, we consider the classiﬁcation problem.

However, when machine learning is in the real-world

environment, the data will be with noisy supervision.

Namely,

-risk under the noisy distribution should be

(X,

Y )∼

[`(f

(X),

Y )]

. Furthermore, under the limited data,

empirical

-risk under the noisy distribution should be as

follows.

) :=

i=1

`(f

), (2)

where

)

is the (observed) noisy example i.i.d. drawn

from noisy distribution

(with noise rate

). Note that

is a suitably modiﬁed loss, which is noise-tolerant. Here,

we empirically demonstrate the generalization difference

between ` and

` under label noise (Figure 1).

Generally, the aim of LNRL is to “construct” such noise-

tolerant

that the learned

(2)

approximates to the

Fig. 1. We empirically demonstrate the generalization difference between

original

and corrected

. We choose MNIST with 35% of uniform noise

as noisy data. There is an obvious gap between

and

on noisy MNIST.

optimal

∗

(1)

well. Speciﬁcally, via a suitably constructed

, we can learn a robust deep classiﬁer

from the noisy

training examples that can assign clean labels for test

instances. Before delving into constructing

, we ﬁrst take a

theoretical look at label-noise learning, which will help us

build

` more effectively.

3.5 Theoretical Understanding

In contrast to [9], [10], via the lens of learning theory, we

provide a systematical way to understand LNRL. Our focus

is to explore why noisy labels affect the performance of deep

models. To ﬁgure it out, we should rethink the essence of

learning with noisy labels. Normally, there are three key

ingredients in label-noise learning problems, including input

data, objective function and optimization policy.

In high level, there are three rule of thumbs, which explain

how to handle noisy labels effectively via deep models.

•

For data, the key is to discover the underlying noise transi-

tion pattern, which directly links the clean class posterior

and the noisy class posterior. Based on this insight, it

is critical to design unbiased estimator to estimate noise

transition matrix T accurately.

•

For objective function, the key is to design noise-tolerant

which enjoys the statistical consistency guarantees. Based

on this insight, it is critical to learn a robust classiﬁer on

noisy data, which can provably converge to the learned

classiﬁer on clean data.

•

For optimization policy, the key is to explore the dynamic

process of optimization policies, which relates to mem-

orization. Based on this insight, it is critical to trade-off

overﬁt/underﬁt in training deep networks, such as early

stopping and small-loss tricks.

剩余23页未读，继续阅读

syp_net

粉丝: 158
资源: 1187

深度学习中的标签噪声表示学习：现状与未来

《深度学习噪声标签学习》综述论文

Noise-label-generation-and-relabeling:在给定噪声率的情况下为数据集生成噪声标签，并使用重新标记算法对这些噪声标签进行重新标记

label-noise-papers:与标签噪声表示学习相关的论文的最新列表在这里

噪声标签的深度学习算法研究

如何用自监督学习解决噪声标签

联邦学习噪声标签攻击

针对噪声标签的国内外研究现状

生成伪标签以后得到的标签数据如何适应于情感漂移检测研究呢

随机标签噪声、类标签噪声和实例相关标签噪声分别是什么

伪标签生成算法本身属于弱监督学习吗，如果不是，如何从弱监督学习的监督优化伪标签生成算法

基于噪声信息的人脸防伪检测技术研究

使用有监督对比学习进行特征提取，不同类别标签下提取出的特征没有分离，是什么原因

深度学习数字图像噪声去除研究的意义 5000字

选基于深度学习的图像去噪算法研究这个课题的灵感来源于哪里

标签中有0、1两个类别，用python写一段关于通过置换标签的方式给数据集按不同比例添加噪声的代码

Alice发送端的信号x表示为： x=us+Wz 在上述表达式中， x表示Alice的发射信号；u表示期望信号预编码向量；s表示期望传输的有用信号；W表示人工噪声预编码矩阵；z表示人工噪声向量。。将上述内容用matlab表示出来

详细介绍机器学习的基本概念和研究内容

如何为cifar添加高斯噪声并获得噪声标签

载入对应工况的数据和标签

最新资源