斯坦福公开课：机器学习导论-模型与算法详解

需积分: 13 130 浏览量更新于2024-07-26 收藏 3.23MB PDF 举报

斯坦福大学开放课程——机器学习，由著名教授Andrew Ng主讲，这门课程深入探讨了机器学习的基础理论和实用算法。课程内容涵盖广泛，从监督学习中的线性回归和最小二乘法，到概率解释和局部加权线性回归，使学生对基础统计建模有了扎实的理解。在第一部分，"Part I Linear Regression"，学生们将学习Least Mean Squares (LMS) 算法，这是求解线性回归问题的一种常用方法。通过矩阵导数讲解最小二乘回归的正常方程，使学生掌握如何通过优化模型参数来最小化误差。这部分还引入了概率解释，帮助学员理解回归问题背后的统计原理。进入分类阶段，"Part II Classification and Logistic Regression"，首先介绍了逻辑回归，这是一种广泛应用在分类任务中的线性模型，其决策边界是由sigmoid函数决定。课程在此处提及了感知机学习算法，作为对比，让学生明白不同模型之间的异同。此外，另一种最大化似然函数的方法也被详细阐述。第三部分，"Generalized Linear Models"，涵盖了指数家族，如Exponential Family，这是统计学中的一个核心概念，用于构建广义线性模型（GLMs）。具体包括线性回归、逻辑回归和softmax回归等模型的构建，这些都是深度理解机器学习模型的重要基石。第四部分，"Generative Learning algorithms"，聚焦于生成式学习，首先是高斯判别分析（GDA），它基于多元正态分布，讲解了该模型的数学基础以及与逻辑回归的对比。接着是朴素贝叶斯分类器，通过拉普拉斯平滑处理数据稀疏性问题，并介绍事件模型在文本分类中的应用。最后，"Part V Support Vector Machines" 是课程的高潮，讨论了支持向量机（SVM）的核心概念，如间隔、函数和几何间隔，以及最优边距分类器的求解策略。课程还涉及拉格朗日乘数法和对偶性，以及在非线性数据上的核技巧，以及正则化的运用以处理非可分问题。SMO（Sequential Minimal Optimization）算法则是这一部分的重点，它是解决大型SVM问题的有效算法。通过这门课程的学习，学生不仅能够掌握基本的机器学习方法，还能理解这些模型背后的数学原理，从而为后续更高级的研究和实践打下坚实的基础。

the training examples’ input values in its rows:

X =







— (x

(1)

)

—

— (x

(2)

)

—

— (x

(m)

)

—







Also, let ~y be the m-dimensional vector containing all the target values from

the training set:

~y =







(1)

(2)

(m)







Now, since h

(i)

) = (x

(i)

)

θ, we can easily verify that

Xθ − ~y =







(1)

)

(m)

)







−







(1)

(m)













(1)

) − y

(1)

(m)

) − y

(m)







Thus, using the fact that for a vector z, we have that z

z =

(Xθ − ~y)

(Xθ − ~y) =

i=1

(i)

) − y

(i)

)

= J(θ)

Finally, to minimize J, lets ﬁnd its derivatives with respect to θ. Combining

Equations (2) and (3), we ﬁnd that

∇

trABA

C = B

+ BA

C (5)

page 10

0 1 2 3 4 5 6 7

0.5

1.5

2.5

3.5

4.5

0 1 2 3 4 5 6 7

0.5

1.5

2.5

3.5

4.5

0 1 2 3 4 5 6 7

0.5

1.5

2.5

3.5

4.5

Instead, if we had added an extra feature x

, and ﬁt y = θ

+ θ

x + θ

then we obtain a slightly better ﬁt to the data. (See middle ﬁgure) Naively, it

might seem that the more features we add, the better. However, there is also

a danger in adding too many features: The rightmost ﬁgure is the result of

ﬁtting a 5-th order polynomial y =

j=0

. We see that even though the

ﬁtted curve passes through the data perfectly, we would not expect this to

be a very good predictor of, say, housing prices (y) for diﬀerent living areas

(x). Without formally deﬁning what these terms mean, we’ll say the ﬁgure

on the left shows an instance of underﬁtting—in which the data clearly

shows structure not captured by the model—and the ﬁgure on the right is

an example of overﬁtting. (Later in this class, when we talk about learning

theory we’ll formalize some of these notions, and also deﬁne more carefully

just what it means for a hypothesis to be good or bad.)

As discussed previously, and as shown in the example above, the choice of

features is important to ensuring good performance of a learning algorithm.

(When we talk about model selection, we’ll also see algorithms for automat-

ically choosing a good set of features.) In this section, let us talk brieﬂy talk

about the locally weighted linear regression (LWR) algorithm which, assum-

ing there is suﬃcient training data, makes the choice of features less critical.

This treatment will be brief, since you’ll get a chance to explore some of the

properties of the LWR algorithm yourself in the homework.

In the original linear regression algorithm, to make a prediction at a query

point x (i.e., to evaluate h(x)), we would:

1. Fit θ to minimize

(i)

− θ

(i)

)

2. Output θ

In contrast, the locally weighted linear regression algorithm does the fol-

lowing:

1. Fit θ to minimize

(i)

− θ

(i)

)

2. Output θ

page 14

剩余225页未读，继续阅读

leewenfeng123

粉丝: 0
资源: 1

斯坦福公开课：机器学习导论-模型与算法详解

《斯坦福大学开放课程: 编程方法》(Open Stanford Course : Programming Methodology)[01-47]

Open Stanford Course : Engineering Everywhere-MachineLearning -- materials

Stanford-Machine-Learning-Course-on-COURSERA：在此资料库中，您将找到我在课程期间完成的演讲幻灯片和编程作业

Stanford-Machine-Learning

Stanford-Machine-Learning-Course:代表机器学习课程的编程练习

matlab说话代码-Ng-Stanford--Machine-Learning:斯坦福机器学习

多元逻辑斯蒂回归matlab代码-stanford-intro-machine-learning:机器学习入门课程（斯坦福）

matlab非参数代码-stanford-machine-learning-course:该存储库包含我用于ML算法实现的Matlab代码。它

matlab代码中向量的点乘-Coursera-Stanford-Machine-Learning-In-Python:CourseraSta

主成分回归代码matlab及例子-Machine-Learning-Stanford-Andrew-Ng:＃MachineLearning（C

最新资源