自动编码器与稀疏编码在非监督学习中的应用

4星 · 超过85%的资源需积分: 9 174 浏览量更新于2024-07-22 收藏 583KB PDF 举报

"这篇讲义主要探讨了非监督学习中的稀疏自动编码器（sparse Autoencoder），由Andrew Ng讲解，是CS294课程的一部分。讲义提到了监督学习在AI领域的广泛应用，如自动邮编识别、语音识别、自动驾驶等，但同时也指出，监督学习依赖于手动指定输入特征的局限性。" 正文: 稀疏自动编码器（sparse Autoencoder）是一种无监督学习方法，它在机器学习领域中扮演着重要的角色，特别是在计算机视觉、音频处理和自然语言处理等领域。传统的监督学习虽然在许多任务上表现出色，但它需要人类专家对输入数据进行特征工程，这是一个耗时且费力的过程。而稀疏自动编码器的目标是自动学习数据的有效表示，从而减轻或避免人工特征工程的需求。自动编码器（Autoencoder）是一种神经网络模型，它的设计目的是通过学习数据的压缩和解压缩过程来提取数据的潜在特征。它由两部分组成：编码器（Encoder）和解码器（Decoder）。编码器将输入数据映射到一个低维度的隐藏层（latent space），而解码器则尝试从这个隐藏层的表示重构原始输入。在这个过程中，自动编码器试图找到一种能尽可能保留输入信息的高效表示。稀疏自动编码器在常规自动编码器的基础上引入了稀疏性约束。这种约束鼓励隐藏层的神经元只对输入的少数特征有响应，而不是对所有特征都产生激活。这有助于学习更有选择性和更具判别性的特征表示，减少冗余，并可能促进更泛化的学习。通常，稀疏性可以通过在损失函数中添加正则项（如Kullback-Leibler散度）来实现，或者通过调整神经元的激活函数（如使用稀疏激励函数）来直接控制激活的平均数量。在训练过程中，稀疏自动编码器会尝试最小化重构误差，同时保持隐藏层的激活状态稀疏。这种学习过程可以看作是对数据的“去噪”和“压缩”，从而学习到的数据表示更加抽象和有意义。学习到的特征可以用于预处理，提升后续任务如分类或回归的性能，也可以用于降维和数据可视化。稀疏自动编码器提供了一种探索数据内在结构的方法，它可以帮助我们自动发现和学习有效的特征表示，无需手动设计。尽管这种方法在某些情况下可能不如精心设计的特征有效，但其自动化和可扩展性为解决新问题和处理大量数据提供了新的可能。随着深度学习技术的发展，自动编码器，特别是稀疏自动编码器，已经成为研究和应用的重要工具，推动着机器学习向更高级别的自动化和智能迈进。

In this ﬁgure, we have used circles to also denote the inputs to the net-

work. The circles labeled “+1” are called bias units, and correspond to the

intercept term. The leftmost layer of the network is called the input layer,

and the rightmost layer the output layer (which, in this example, has only

one node). The middle layer of nodes is called the hidden layer, because

its values are not observed in the training set. We also say that our example

neural network has 3 input units (not counting the bias unit), 3 hidden

units, and 1 output unit.

We will let n

denote the number of layers in our network; thus n

= 3

in our example. We label layer l as L

, so layer L

is the input layer, and

layer L

the output layer. Our neural network has parameters (W, b) =

(1)

, b

(1)

, W

(2)

, b

(2)

), where we write W

(l)

to denote the parameter (or weight)

associated with the connection between unit j in layer l, and unit i in layer

l+1. (Note the order of the indices.) Also, b

(l)

is the bias associated with unit

i in layer l+1. Thus, in our example, we have W

(1)

∈ R

3×3

, and W

(2)

∈ R

1×3

Note that bias units don’t have inputs or connections going into them, since

they always output the value +1. We also let s

denote the number of nodes

in layer l (not counting the bias unit).

We will write a

(l)

to denote the activation (meaning output value) of

unit i in layer l. For l = 1, we also use a

(1)

= x

to denote the i-th input.

Given a ﬁxed setting of the parameters W, b, our neural network deﬁnes a

hypothesis h

W,b

(x) that outputs a real number. Speciﬁcally, the computation

that this neural network represents is given by:

(2)

= f(W

(1)

+ W

(1)

+ W

(1)

+ b

(1)

) (2)

(2)

= f(W

(1)

+ W

(1)

+ W

(1)

+ b

(1)

) (3)

(2)

= f(W

(1)

+ W

(1)

+ W

(1)

+ b

(1)

) (4)

W,b

(x) = a

(3)

= f(W

(2)

+ W

(2)

+ W

(2)

+ b

(2)

) (5)

In the sequel, we also let z

(l)

denote the total weighted sum of inputs to unit

i in layer l, including the bias term (e.g., z

(2)

j=1

(1)

+ b

(1)

), so that

(l)

= f(z

(l)

Note that this easily lends itself to a more compact notation. Speciﬁcally,

if we extend the activation function f(·) to apply to vectors in an element-

wise fashion (i.e., f([z

, z

]) = [f(z

), f(z

)]), then we can write

剩余18页未读，继续阅读

echea

粉丝: 0
资源: 1

自动编码器与稀疏编码在非监督学习中的应用

使用Sparse Autoencoder训练隐含层网络特征提取

基于Sigmoid函数的Sparse Autoencoder实现教程

C语言内存分配实现及sparse autoencoder特征学习源码解析

Sparse-Autoencoder-Tensorflow-master.rar_autoencoder_tensorflow_

Autoencoder_Code.zip_Autoencoder_Code_matlab autoencoder_自动编码_自动

SAE_deepsae_sae_冗余特征_autoencoder_降维_

tensorflow_stacked_denoising_autoencoder:Tensorflow中堆叠式去噪自动编码器的实现

Sparse-Autoencoder

Exercise10 Sparse Coding.zip_MATLAB 图像分类_sparse_稀疏编码提取_稀疏自编码器_自编

Hyperspectral image classification using k-sparse denoising autoencoder and spectral–restricted spatial characteristics, Applied Soft Computing Journal

最新资源