张量环结构在受限玻尔兹曼机中的应用：TR-RBM模型

研究论文

155 浏览量更新于2024-08-26 收藏 2.46MB PDF 举报

身份认证购VIP最低享 7 折!

30元优惠券

"张量环受限玻尔兹曼机（TensorRing Restricted Boltzmann Machines, TR-RBM）是一种针对高阶数据处理的新型生成模型，由Maolin Wang, Chenbin Zhang, Yu Pan, Jing Xu和Zenglin Xu在SMILELab，中国电子科技大学计算机科学与工程学院的研究论文中提出。该模型利用张量环（Tensor Ring, TR）分解结构来自然地表示视觉层和隐藏层之间的高阶关系，以解决传统受限玻尔兹曼机（Restricted Boltzmann Machines, RBMs）处理高阶数据时可能出现的模式塌缩和参数爆炸问题。" 受限玻尔兹曼机（RBM）是无监督学习中的重要概率生成模型，它能够从向量输入数据中学习概率分布。然而，标准的RBM设计主要针对向量化输入，对于高阶数据（如图像、视频或多模态数据），向量化可能导致模式塌缩（模式消失）和参数数量急剧增长，这在训练和泛化过程中都带来了挑战。为了解决这些问题，研究人员提出了张量环受限玻尔兹曼机（TR-RBM）。TR-RBM的核心是引入了张量环分解，这是一种高效的高阶张量表示方法。张量环分解具有良好的秩稳定性，这意味着即使在处理高维数据时，也能保持较低的模型复杂度，从而改善了模型的泛化性能。这种分解结构允许TR-RBM更有效地捕获数据中的高阶依赖关系，而不会像传统RBM那样受到维度灾难的影响。在TR-RBM中，视觉层的观测值和隐藏层的激活状态不再通过简单的线性连接，而是通过张量环结构进行非线性交互。张量环分解将高维张量分解为一系列低秩的张量积，这样可以显著减少模型参数的数量，同时保持数据的表示能力。此外，由于张量环的稀疏性和低秩特性，TR-RBM在训练过程中的计算效率也相对较高，有利于大尺度数据集的处理。在实际应用中，TR-RBM可能被用于图像识别、推荐系统、自然语言处理等领域，尤其是在处理包含丰富结构信息的数据时，能够展现出优于传统RBM的优势。通过张量环结构，TR-RBM能够更好地学习数据的复杂结构，并在保持模型泛化性能的同时降低计算复杂度，为高阶数据的建模提供了一种有效的方法。

资源详情

资源推荐

Tensor Ring Restricted Boltzmann Machines

Maolin Wang, Chenbin Zhang, Yu Pan, Jing Xu and Zenglin Xu

SMILE Lab, School of Computer Science and Engineering

University of Electronic Science and Technology of China

Chengdu, Sichuan, China

Email: {morin.w98, aleczhang13, ypyupan, xujing.may, zenglin}@gmail.com

Abstract—Restricted Boltzmann Machines are important and

useful generative models which learn a probability distribution

from a set of vector inputs. Despite their success in a number

of applications, standard RBMs designed for vectorized inputs

are incapable of dealing with high-order data, since vector-

ization of high-order data may cause both modes collapsing

and explosive parameter growth. To address this issue, we

formulate a new tensor-input RBM model, which employs the

tensor-ring (TR) decomposition structure to naturally represent

the high-order relationship between the visual layer and the

hidden layer. For convenience, we name the proposed model as

TR-RBM. In particular, the tensor ring decomposition enjoys

many good properties, such as the rank stableness, leading to

better generalization performance compared with other low-rank

decomposition methods. Moreover, TR-RBM can also reduce the

complexity of RBM by reshaping of both visible and hidden layers

into the tensor forms, leading a signiﬁcant drop of parameter size.

Experimental results in comparison with the classical RBMs and

the Matrix-Product-Operator RBM have shown the promising

performance of the proposed method in the tasks of feature

extraction and denoising.

Index Terms—tensors, tensor decomposition, tensor ring, Re-

stricted Boltzmann Machines, feature extraction

I. INTRODUCTION

Restricted Boltzmann Machines (RBM) are generative mod-

els which can learn a probability distribution from the set of

inputs [1]. Due to the powerful ability of extracting features,

RBMs have been widely used in speech recognition [2],

collaborative ﬁltering [3], network anomaly detection [4] and

computer vision [5].

A standard RBM is specially designed for vector input

data and not efﬁcient for matrices or higher-dimensional array

data which is very common in many applications. To solve

this problem, transforming high-dimensional data into a one-

dimensional vector was applied before. However, this method

ignores the relationship between different data modes and may

lead to the curse of dimensionality which means the explosion

of the number of corresponding parameters [6].

A promising way to generalize the vector input model to

the high-order model input is to apply tensorized models or

low-rank tensor decompositions. Due to the good properties

and successful applications of the Tensor Ring Decomposi-

tion(TRD) [7]–[9] in convolutional neural networks [8], [10]

We thank the anonymous reviewers for valuable comments to improve

the quality of our paper. This work was partially supported by Na-

tional Natural Science Foundation of China (Nos.61572111 and 61876034),

and a Fundamental Research Fund for the Central Universities of China

(No.ZYGX2016Z003).

and recurrent neural networks [9], [11], [12], we propose the

Tensor-Ring Restricted Boltzmann machine(TR-RBM) model.

In detail, the weight matrix is reshaped into a high-dimensional

tensor and then we decompose it by using TRD, thus the cor-

relation information among data modes can be maintained and

utilized. Powered by the great properties of TRD, our model

is expected to have better information extraction performance

among data modes but with fewer parameters.

This article has the following contributions:

• We ﬁrst apply tensor ring decomposition structure

on RBM. And the classical RBM, matrix-variate

RBM(MvRBM) and tensor-variate RBM(TvRBM) can be

regarded as special cases of TR-RBM. The computation

complexity and ﬂexibility of TR-RBM is better than

matrix product operator(MPO) RBM.

• The parameter number of TR-RBM is highly compressed.

So the explosion of the RBM parameters with the order

of input data can be avoided.

• An alternating optimization algorithm of TR-RBM is de-

signed. The space and time complexity is also provided.

The rest of the paper is organized as follows. Section II

presents the insightful review to the related literature works

and highlights the property of the Tensor Ring Decomposition.

Section III shows some preliminaries of RBMs and tensors.

In Section IV, we describe our TR-RBM model, introduce the

learning algorithm, and give the complexity of the algorithm.

In Section V, we conduct a series of experiment to evaluate

the performance of the proposed method. Finally, Section VI

concludes this paper.

II. RELATED WORKS

Many real-world data are multi-modes e.g. patient drug

responses with four modes(person, medicine, biomarker, time)

[13]. Vectoring multi-modes data is a common approach which

may result in relationship information loss, and further inﬂu-

ence the performance of models. In addition, this approach

suffers from a large number of parameters as the dimension of

input data increases [14]. In parallel, tensor decomposition on

higher-mode data can capture higher-order correlations while

maintaining fewer parameters. Therefore, research on low-rank

tensor structures have been attached great importance.

Standard low-rank tensor structures include the Tucker de-

composition [15], CANDECOMP/PARAFAC(CP) [16], [17],

pair-wise decomposition [18], [19], InfTucker and its varia-

tions [13], [20]–[22], tensor train decomposition(TTD) [23],

下载后可阅读完整内容，剩余7页未读，立即下载

weixin_38726441

粉丝: 4
资源: 907

张量环结构在受限玻尔兹曼机中的应用：TR-RBM模型

基于规范多态分解的矩阵变量受限玻尔兹曼机

tensorflow-rbm:受限玻尔兹曼机的Tensorflow实现

为什么张量环分解rd+1 = r1

如何使用使用tensor.cpu()将张量复制到主机内存。

不能将cuda:0设备类型的张量转换为numpy，请先使用tensor.cpu()将张量复制到主机内存。怎么操作

支持张量机的matlab代码

matlab中优化张量变成张量

完成Tensor（张量）的基本操作，包括张量的创建、张量的数学运算、张量元素的操作、张量序列的创建、常数张量的创建以及随机张量的创建

如何将张量变成张量元组

创建一个秩为0的张量，张量的值为0，并且打印该张量以及该张量的值

反卷积中核张量与输入张量有什么关系

如何将输入张量转化为CUDA张量

张量的差分运算是n-1张量

张量基础操作pytorch

创建一个秩为0的张量，张量的值为0，并且打印该张量以及该张量的值；

输入张量和输出张量具体区别在哪儿

张量构成的字符串转化张量

np张量转化为torch张量

请详细介绍有限尺寸闭合张量网络收缩算法的步骤

张量 数据压缩 matlab

最新资源

张量数据压缩 matlab