迁移学习：改善任务学习的策略与挑战

需积分: 10 131 浏览量更新于2024-07-09 收藏 176KB PDF 举报

“Transfer Learning - Lisa Torrey and Jude Shavlik - University of Wisconsin, Madison, USA” 本文档主要探讨了“迁移学习”这一主题，由University of Wisconsin的Lisa Torrey和Jude Shavlik撰写。迁移学习是一种通过从已学习的相关任务转移知识来提升新任务学习效果的方法。尽管大多数机器学习算法专注于单一任务，但设计能够促进迁移学习的算法是机器学习领域持续关注的研究方向。在介绍部分，作者指出人类学习者似乎具有内在的知识转移能力，当我们面临新任务时，能从过去的学习经验中识别并应用相关知识。新任务与我们先前经验的相关性越高，我们掌握它的速度就越快。这为机器学习领域的迁移学习提供了灵感。文档接着详细阐述了迁移学习的目标、形式化表示以及面临的挑战。它概述了当前的研究进展，展示了该领域的最新技术，并指出了开放性问题。这部分内容涵盖了归纳学习（inductive learning）和强化学习（reinforcement learning）中的迁移学习，特别是对负迁移（negative transfer）和任务映射（task mapping）这两个关键问题进行了深入讨论。负迁移是指从一个任务学到的知识可能对新任务产生负面影响，这在设计迁移学习策略时是一个需要特别注意的问题。而任务映射则涉及如何有效地将源任务的知识结构与目标任务相匹配，以确保知识的有效转移。此外，文档可能还涵盖了各种迁移学习的技术，如特征提取、微调、多任务学习、领域适应以及元学习等方法。这些技术旨在通过共享表示、模型重用或调整来提高新任务的学习效率。这份资料提供了一个全面的迁移学习概览，对于想要深入了解这一领域的研究人员和实践者来说，是一份宝贵的资源。它不仅总结了现有的研究成果，还揭示了未来研究可能的方向，对于推动机器学习在不同任务间的知识迁移和泛化能力的发展具有重要意义。

Thrun and Mitchell [55] look at solving Boolean classiﬁcation tasks in a

lifelong-learning framework, where an agent encounters a collection of related

problems over its lifetime. They learn each new task with a neural network, but

they enhance the standard gradient-descent algorithm with slope information

acquired from previous tasks. This speeds up the search for network parameters

in a target task and biases it towards the parameters for previous tasks.

Mihalkova and Mooney [27] perform transfer between Markov Logic Net-

works. Given a learned MLN for a source task, they learn an MLN for a related

target task by starting with the source-task one and diagnosing each formula,

adjusting ones that are to o general or too speciﬁc in the target domain. The

hypothesis space for the target task is therefore deﬁned in relation to the source-

task MLN by the operators that generalize or specify formulas.

Hlynsson [17] phrases transfer learning in classiﬁcation as a minimum descrip-

tion length problem given source-task hypotheses and target-task data. That is,

the chosen hypothesis for a new task can use hypotheses for old tasks but stip-

ulate exceptions for some data points in the new task. This method aims for a

tradeoﬀ between accuracy and compactness in the new hypothesis.

Ben-David and Schuller [3] propose a transformation framework to determine

how related two Boolean classiﬁcation tasks are. They deﬁne two tasks as related

with respect to a class of transformations if they are equivalent under that class;

that is, if a series of transformations can make one task look exactly like the

other. They provide conditions under which learning related tasks concurrently

requires fewer examples than single-task learning.

Bayesian Transfer

One area of inductive transfer applies speciﬁcally to Bayesian learning meth-

ods. Bayesian learning involves modeling probability distributions and taking

advantage of conditional independence among variables to simplify the mo del.

An additional aspect that Bayesian models often have is a prior distribution,

which describes the assumptions one can make about a domain before seeing

any training data. Given the data, a Bayesian model makes predictions by com-

bining it with the prior distribution to produce a posterior distribution. A strong

prior can signiﬁcantly aﬀect these results (see Figure 5). This serves as a natural

way for Bayesian learning methods to incorporate prior knowledge – in the case

of transfer learning, source-task knowledge.

Marx et al. [24] use a Bayesian transfer method for tasks solved by a logistic

regression classiﬁer. The usual prior for this classiﬁer is a Gaussian distribution

with a mean and variance set through cross-validation. To perform transfer, they

instead estimate the mean and variance by averaging over several source tasks.

Raina et al. [33] use a similar approach for multi-class classiﬁcation by learning

a multivariate Gaussian prior from several source tasks.

Dai et al. [7] apply a Bayesian transfer method to a Naive Bayes classiﬁer.

They set the initial probability parameters based on a single source task, and

revise them using target-task data. They also provide some theoretical bounds

on the prediction error and convergence rate of their algorithm.

剩余21页未读，继续阅读

Ritannn

粉丝: 192
资源: 2

迁移学习：改善任务学习的策略与挑战

A Comprehensive Survey on Transfer Learning.pdf

secure federated transfer learning.pdf

A survey of transfer learning.pdf

An introduction to domain adaptation and transfer learning.pdf

Multitask learning.pdf

Transfer Learning论文阅读.pdf

A survey on transfer learniing.pdf

A Survey on Transfer Learning_withMarginNotes.pdf

Transfer Learning without Full Model.pdf

why does unsupervised pre-training help deep learning.pdf

最新资源