深入理解主成分分析PCA：从概念到代码

5星 · 超过95%的资源需积分: 49 183 浏览量更新于2024-09-16 收藏 324KB PDF 举报

"A tutorial on Principal Components Analysis" 主成分分析(PCA)是现代数据分析的核心工具，广泛应用于神经科学、计算机图形学等多个领域。PCA的主要作用是通过降维来简化复杂数据集，同时最大化保留原始数据集的信息。这篇教程由Jonathon Shlens撰写，旨在深入浅出地解释PCA的工作原理，既注重直观理解，又涵盖数学推导，适合不同水平的读者学习。一、PCA简介 PCA是一种统计方法，用于发现数据集的主要变异性方向，并将数据投影到这些主要方向上，形成新的坐标系。这种方法能够降低数据的维度，使数据更容易处理和可视化，同时减少噪声的影响。在介绍PCA时，教程通常会从一个二维例子开始，以便直观展示PCA如何工作。二、数学基础 PCA的关键在于计算数据的协方差矩阵，并找到该矩阵的特征值和特征向量。特征向量对应于数据最大变异性的方向，而特征值则表示沿着这些方向的方差。通过排序特征值和对应的特征向量，我们可以确定主要成分的顺序。三、二维示例教程中可能会使用一个二维数据集来演示PCA的过程。首先，数据会在原始坐标系中展示，然后通过计算协方差矩阵找到主要成分。最后，数据会被投影到由主要特征向量定义的新坐标轴上，展现降维后的效果。四、计算机视觉应用 PCA在计算机视觉中有多种应用，如图像压缩、特征提取和人脸识别。例如，PCA可以用于降低图像的色彩或空间维度，从而减少存储和计算需求，同时保持关键信息。在人脸识别中，PCA可以用来提取人脸的主要特征，实现有效的识别算法。五、代码实现尽管教程中的代码不是用MATLAB编写的，但通常PCA的实现可以用Python的NumPy库或者R语言等工具，通过计算矩阵的特征值和特征向量来完成。理解PCA的数学基础可以帮助开发者更好地理解和调整代码，以适应特定的数据集和需求。六、PCA的优势与局限 PCA的优势在于其简单性和普适性，但也有局限性。例如，PCA假设数据是线性可分的，这在某些复杂的数据模式中可能不成立。此外，PCA可能会丢失非主成分中的重要信息，尤其是在噪声较大的情况下。总结来说，"A tutorial on Principal Components Analysis"提供了一个全面且深入的学习资源，它不仅解释了PCA的基本概念，还展示了PCA的实际应用和数学细节，是理解并掌握这一重要数据分析技术的理想起点。无论你是初学者还是有经验的数据分析师，都能从中受益。

展开

where each row is an orthornormal basis vector b

with m

components. We can consider our naive basis as the effective

starting point. All of our data has been recorded in this basis

and thus it can be trivially expressed as a linear combination

of {b

B. Change of Basis

With this rigor we may now state more precisely what PCA

asks: Is there another basis, which is a linear combination of

the original basis, that best re-expresses our data set?

A close reader might have noticed the conspicuous addition of

the word linear. Indeed, PCA makes one stringent but power-

ful assumption: linearity. Linearity vastly simpliﬁes the prob-

lem by restricting the set of potential bases. With this assump-

tion PCA is now limited to re-expressing the data as a linear

combination of its basis vectors.

Let X be the original data set, where each column is a single

sample (or moment in time) of our data set (i.e.

X). In the toy

example X is an m ×n matrix where m = 6 and n = 72000.

Let Y be another m ×n matrix related by a linear transfor-

mation P. X is the original recorded data set and Y is a new

representation of that data set.

PX = Y (1)

Also let us deﬁne the following quantities.

• p

are the rows of P

• x

are the columns of X (or individual

X).

• y

are the columns of Y.

Equation 1 represents a change of basis and thus can have

many interpretations.

1. P is a matrix that transforms X into Y.

2. Geometrically, P is a rotation and a stretch which again

transforms X into Y.

3. The rows of P,

{

,...,p

}

, are a set of new basis vec-

tors for expressing the columns of X.

The latter interpretation is not obvious but can be seen by writ-

In this section x

and y

are column vectors, but be forewarned. In all other

sections x

and y

are row vectors.

ing out the explicit dot products of PX.

PX =















··· x



Y =







·x

··· p

·x

··· p

·x







We can note the form of each column of Y.







·x







We recognize that each coefﬁcient of y

is a dot-product of

with the corresponding row in P. In other words, the j

coefﬁcient of y

is a projection on to the j

row of P. This is

in fact the very form of an equation where y

is a projection

on to the basis of

{

,...,p

}

. Therefore, the rows of P are a

new set of basis vectors for representing of columns of X.

C. Questions Remaining

By assuming linearity the problem reduces to ﬁnding the ap-

propriate change of basis. The row vectors

{

,...,p

}

this transformation will become the principal components of

X. Several questions now arise.

• What is the best way to re-express X?

• What is a good choice of basis P?

These questions must be answered by next asking ourselves

what features we would like Y to exhibit. Evidently, addi-

tional assumptions beyond linearity are required to arrive at

a reasonable result. The selection of these assumptions is the

subject of the next section.

IV. VARIANCE AND THE GOAL

Now comes the most important question: what does best ex-

press the data mean? This section will build up an intuitive

answer to this question and along the way tack on additional

assumptions.

A. Noise and Rotation

Measurement noise in any data set must be low or else, no

matter the analysis technique, no information about a signal

下载后可阅读完整内容，剩余11页未读，立即下载

身份认证购VIP最低享 7 折!

30元优惠券

lwner

粉丝: 1

深入理解主成分分析PCA：从概念到代码

A tutorial on Principal Components Analysis.pdf

机器学习 A tutorial on Principal Components Analysis

Principal Components Analysis:

Matlab_Tutorial.rar_matlab例程_matlab_

Remote Sensing Digital Image Analysis-Fourth Edition

江苏省培育壮大数据企业行动方案（2025-2027年）.docx

西门子200smart恒压供水系统PLC程序解析与应用

基于STM32设计的数字示波器全套资料（原理图、PCB图、源代码）

基于Copula函数的风电场概率潮流高效计算方法及其应用

Delphi 12.3控件之东软数据平台-SM4-加解密-demo.7z

最新资源