支持向量机回归教程：算法与应用

4星 · 超过85%的资源需积分: 9 156 浏览量更新于2024-07-24 收藏 790KB PDF 举报

"这篇教程是关于支持向量回归(Support Vector Regression, 简称SVR)的介绍，由Alex J. Smola和Bernhard Scholkopf撰写，属于GMD NeuroCOLT技术报告系列。文章涵盖了SVR的基本概念、训练算法以及处理大规模数据集的高级方法，并提到了一些对SVR的改进和扩展。" 支持向量机（Support Vector Machines, SVM）最初是作为分类工具被提出的，但后来发展出支持向量回归（Support Vector Regression, SVR）以应对连续变量预测问题。在SVR中，模型的目标是找到一个能够最小化预测值与真实值之间误差的超平面。这个超平面通过最大化与最近的数据点（称为支持向量）的距离来确定，从而确保模型具有良好的泛化能力。本文首先介绍了SVR的基本思想：通过构造一个间隔最大化的边界来拟合数据，这个边界可以容忍一定程度的误差，即ε-范数损失函数。ε-范数允许在模型预测值与实际值之间存在一定的差距（ε-tube），只要大部分数据点在这个范围内，模型就被认为是有效的。接着，教程详细讲解了训练SVR的算法。最常用的是基于凸优化问题的Quadratic Programming（QP）求解器，这包括解决线性可分情况下的硬间隔最大化和非线性情况下的软间隔最大化。对于非线性问题，SVM通常采用核技巧，如高斯核（RBF）、多项式核或Sigmoid核，将数据映射到高维空间，使得原本在原始空间中难以分离的数据在新空间中变得容易处理。对于大规模数据集，传统的QP方法可能会面临计算复杂性和内存限制。因此，文章提到了一些高效算法，如Sequential Minimal Optimization (SMO) 和 Cutting Plane Methods，它们能够在训练过程中有效地处理大量数据，同时保持较好的性能。最后，作者讨论了对SVR的一些修改和扩展，例如引入惩罚项以控制模型复杂度，防止过拟合；或者使用在线学习算法进行增量训练，适应动态数据流。此外，他们还可能涉及了多任务学习、异常检测等应用场景，以及如何结合其他机器学习方法提升SVR的性能。这篇教程对于理解支持向量回归的核心原理及其在实际应用中的优化策略具有很高的参考价值，适合对机器学习和统计建模感兴趣的读者深入学习。

Cost Functions 10

instance when dealing with few data in very high-dimensional spaces, this may

not be a good idea, as it will lead to overtting and thus bad generalization

prop erties. Hence one should add a capacity control term, whichintheSVcase

results to b e

, which leads to the regularized risk functional Tikhonov and

Arsenin, 1977, Morozov, 1984, Vapnik, 1982]

reg



]:=

emp





(33)

where

>

0isaso called

regularization

constant. Many algorithms like regu-

larization networks Girosi et al., 1993] or weight decay networks Bishop, 1995]

minimize an expression similar to (33).

3.2 Maximum Likeliho o d and Density Mo dels

Now the question arises, which cost functions

(

x y  f

(

)) should be used in

(33). The standard setting in the SV case is, as already mentioned in section

1.2,

(

x y  f

(

)) =

;

(

)

(34)

It is straightforward to show, that minimizing (33) with the particular loss

function of (34) is equivalenttominimizing (3), the only dierence b eing that

(

`

Loss functions suchlike

;

(

)

with

1may not b e desirable, as the

sup erlinear increase leads to a loss of the robustness prop erties of the estimator

(see e.g. Hub er, 1981]): in those cases the derivative of the cost function may

grow without b ound. For

1 the loss function b ecomes nonconvex.

For the case of

(

x y  f

(

)) = (

;

(

))

we recover the least mean squares

t approach, which, unlike the standard SV loss function, leads to a matrix

inversion instead of a quadratic programming problem.

The question that now arises is which cost function should b e used in (33).

On the one hand we will want to avoid using a very complicated function

as this may lead to dicult optimization problems. On the other hand one

should use that particular cost function that suits the data b est. For instance

wemay b e given a cost function ~

by some real world problem, hence we should

use this particular one. Moreover, under the assumption that the samples

were generated by an underlying functional dep endency plus additive noise

true

(



with density

(



) the optimal cost function in a maximum

likeliho od sense would b e

(

x y  f

(

)) =

;

log

(

;

(

))

(35)

This can b e seen as follows. The likeliho od of an estimate

(

f

(

))

:::

(

f

(

))

(36)

See Smola, 1998] for a discussion of other regularization terms and invariance properties

of quadratic regularization functionals.

Cost Functions 11

loss function density mo del

{insensitive

(





(



2(1+

)

exp(



)

Laplacian

(



(



exp(



)

Gaussian

(



(





exp(

;



)

Hub er's

robust loss

(







(



)



j







otherwise

(



)

(

exp(

;





) if



j



exp(





) otherwise

Polynomial

(



(



2;(1

)

exp(



)

Piecewise

p olynomial

(



(

p

;

(



)



j







;

otherwise

(



)

(

exp(

;



p

;

) if



j



exp(



;



) otherwise

Table 1

Common loss functions and corresp onding densitymodels

under the assumption of additive noise and iid data is

(

)

(

y

)) =

(

)

(

;

(

))

(37)

Maximizing

(

) is equivalent to minimizing

;

log

(

). By using

(35) weget

;

log

(

y

f

(

)) (38)

which proves the statement.

However, the cost function resulting from this reasoning might b e noncon-

vex. In this case one would have to nd a convex proxy in order to deal with

the situation eciently (i.e. to nd an ecient implementation of the corre-

sp onding optimization problem). Moreover, the situation of regression as such,

i.e. without any knowledge of cost functions, is not prop erly dened from the

viewp oint of structural

risk

minimization:

risk

can only b e minimized if it can

be quantied via a cost function (i.e. a p enalty for deviations). Finally, given

a sp ecic cost function from a real world problem, one should try to nd as

close a proxy to this cost function as p ossible, as it is the p erformance wrt. this

particular cost function that matters ultimately.

Table 1 contains an overview over some common density mo dels and the

corresp onding loss functions as dened by (35), whereas gure 2 contains graphs

of the corresp onding functions. The only requirement we will imp ose on

in the following is that for xed

and

we have convexity in

(

). This

requirement is made, as we want to ensure the existence and uniqueness (for

strict convexity) of a minimum of optimization problems by imp osing convexity

Fletcher, 1989].

3.3 Solving the Equations

However, for the sake of simplicitywe will additionally assume

to b e symmetric

and to have (at most) two (for symmetry) discontinuities at



" "



0 in the

剩余72页未读，继续阅读

Lin-JM

粉丝: 1083
资源: 24

支持向量机回归教程：算法与应用

A tutorial on support vector regression.Statistics and Computing,2004

SVM经典论文，如资源描述所示

tutorial for SVM regression

S18Tutorial:SVR教程（示例）

【Advanced】Basic Machine Learning in MATLAB: Classification and Regression

白色简洁风格的软件UI界面后台管理系统模板.zip

自动软包电芯极耳短路测试精切一体机sw17可编辑全套技术资料100%好用.zip

RuntimeException如何解决.md

云链客服需要注意的事项

白色简洁风格的室内设计案例源码下载.rar

最新资源