tf.Variable ,tf.matmul, tf.reduce_sum这三种方法是什么意思

这三种方法都是 TensorFlow 中的函数。 1. tf.Variable：用于创建可训练的变量，可以用来存储模型的参数，例如权重和偏置项等。创建后，变量可以通过 TensorFlow 的优化器进行训练和更新。 2. tf.matmul：用于进行矩阵乘法操作。矩阵乘法是深度学习中常用的操作，例如在神经网络中，用于计算输入数据和权重矩阵之间的乘积。 3. tf.reduce_sum：用于计算张量的元素之和，可以沿着指定的轴进行计算。在深度学习中，通常用于计算损失函数的值，例如交叉熵损失函数的计算。

X_parameters = tf.Variable(tf.random_normal([bookNo, num_features],stddev = 0.35)) Theta_parameters = tf.Variable(tf.random_normal([userNo, num_features],stddev = 0.35)) optimizer = tf.train.AdamOptimizer(1e-4) loss = 1/2 * tf.reduce_sum(((tf.matmul(X_parameters, Theta_parameters, transpose_b = True) - rating_norm) * record) ** 2) + 1/2 * (tf.reduce_sum(X_parameters 2) + tf.reduce_sum(Theta_parameters 2)) train = optimizer.minimize(loss) tf.summary.scalar('loss', loss)

这段代码实现了一个基于TensorFlow的矩阵分解推荐算法。首先，它定义了两个变量X_parameters和Theta_parameters，它们分别表示书籍和用户的特征矩阵。这两个矩阵的大小分别为[书籍数量，特征数量]和[用户数量，特征数量]。这里使用tf.Variable()函数将它们定义为可训练的变量，并且使用tf.random_normal()函数初始化它们的值为符合正态分布的随机数。然后，它定义了一个Adam优化器，用于最小化损失函数。这里使用的学习率为1e-4。接着，它定义了损失函数。这个损失函数是一个基于矩阵分解的推荐算法的常见形式，包括两个部分：一个是预测评分与真实评分之间的差异（差的平方），另一个是正则化项（防止过拟合）。具体来说，它使用tf.matmul()函数计算X_parameters和Theta_parameters的乘积，得到预测的评分矩阵，然后使用tf.reduce_sum()函数计算预测评分与真实评分之间的差异（差的平方），并乘以record矩阵（即评分矩阵是否有值的矩阵，用于过滤掉没有评分的项）。最后，它将正则化项（防止过拟合）加入到损失函数中，其中使用tf.reduce_sum()函数计算X_parameters和Theta_parameters的平方和，并将它们相加。然后，它使用optimizer.minimize()函数最小化损失函数，以更新X_parameters和Theta_parameters矩阵的值。同时，使用tf.summary.scalar()函数定义了一个损失值的摘要（summary），以便在训练过程中可视化损失函数的变化。

# 设置输入层节点数、隐层节点数 in_nodes=784 h1_nodes=100 h2_nodes=100 h3_nodes=50 # 定义输入、输出、prob的placeholder x=tf.placeholder(tf.float32,[None,in_nodes]) y_=tf.placeholder(tf.float32,[None,10]) prob=tf.placeholder(tf.float32) # 设置第一隐层 w1=weight(in_nodes, h1_nodes, 0.1, 0.005) b1=tf.Variable(tf.zeros([h1_nodes])) h1=tf.nn.relu(tf.matmul(x,w1)+b1) # 设置第二隐层 w2=weight(h1_nodes, h2_nodes, 0.1, 0.0) b2=tf.Variable(tf.zeros([h2_nodes])) h2=tf.nn.relu(tf.matmul(h1,w2)+b2) h2_drop=tf.nn.dropout(h2, prob) # 设置第三隐层 w3=weight(h2_nodes, h3_nodes, 0.1, 0.0) b3=tf.Variable(tf.zeros([h3_nodes])) h3=tf.nn.relu(tf.matmul(h2_drop,w3)+b3) h3_drop=tf.nn.dropout(h3, prob) # 设置softmax输出层 w4=weight(h3_nodes, 10, 0.1, 0.0) b4=tf.Variable(tf.zeros([10])) y=tf.nn.softmax(tf.matmul(h3_drop,w4)+b4)

将上述代码改写成 TensorFlow 2.x 版本的代码如下： ```python import tensorflow as tf def weight(shape, stddev, wd): initial = tf.random.truncated_normal(shape, stddev=stddev) var = tf.Variable(initial) if wd is not None: weight_decay = tf.multiply(tf.nn.l2_loss(var), wd, name='weight_loss') tf.add_to_collection('losses', weight_decay) return var class MyModel(tf.keras.Model): def __init__(self, in_nodes=784, h1_nodes=100, h2_nodes=100, h3_nodes=50): super(MyModel, self).__init__() self.w1 = weight([in_nodes, h1_nodes], 0.1, 0.005) self.b1 = tf.Variable(tf.zeros([h1_nodes])) self.w2 = weight([h1_nodes, h2_nodes], 0.1, 0.0) self.b2 = tf.Variable(tf.zeros([h2_nodes])) self.w3 = weight([h2_nodes, h3_nodes], 0.1, 0.0) self.b3 = tf.Variable(tf.zeros([h3_nodes])) self.w4 = weight([h3_nodes, 10], 0.1, 0.0) self.b4 = tf.Variable(tf.zeros([10])) def call(self, inputs, prob): x = inputs y_ = tf.cast(inputs, tf.float32) h1 = tf.nn.relu(tf.matmul(x, self.w1) + self.b1) h2 = tf.nn.relu(tf.matmul(h1, self.w2) + self.b2) h2_drop = tf.nn.dropout(h2, rate=prob) h3 = tf.nn.relu(tf.matmul(h2_drop, self.w3) + self.b3) h3_drop = tf.nn.dropout(h3, rate=prob) y = tf.nn.softmax(tf.matmul(h3_drop, self.w4) + self.b4) return y model = MyModel() x = tf.keras.Input(shape=(None, 784)) prob = tf.keras.Input(shape=()) y = model(x, prob) y_ = tf.keras.Input(shape=(None, 10)) # 定义损失函数 cross_entropy = tf.reduce_mean(-tf.reduce_sum(y_ * tf.math.log(y), reduction_indices=[1])) tf.add_to_collection('losses', cross_entropy) loss = tf.add_n(tf.get_collection('losses')) # 定义优化器 train_step = tf.train.AdamOptimizer(1e-4).minimize(loss) # 训练模型 with tf.Session() as sess: sess.run(tf.global_variables_initializer()) for i in range(1000): batch_xs, batch_ys = mnist.train.next_batch(100) sess.run(train_step, feed_dict={x: batch_xs, y_: batch_ys, prob: 0.5}) ``` 在 TensorFlow 2.x 中，可以使用 `tf.reduce_mean` 和 `tf.reduce_sum` 函数来计算张量的平均值和总和；使用 `tf.math.log` 函数来计算张量的自然对数。此外，可以使用 `tf.train.AdamOptimizer` 来定义优化器，使用 `model.trainable_variables` 来获取所有可训练的变量。

阅读全文

tf.Variable ,tf.matmul, tf.reduce_sum这三种方法是什么意思

相关推荐

CUDA11.0-C-Programming-Guide.pdf

深度学习作业1.docx

TensorFlow的常量.变量.常用函数.docx

MNIST_data

请用trensorflow计算1+2+3+……+n，要求要使用tf.variable,tf.constant,tf.placeholder,并且n要用户输入

通过Tensorflow来进行计算1+2+…+10，必须要用到tf.Variable、tf.constant

通过Tensorflow来进行计算1+2+...+10，必须要用到tf.Variable、tf.constant，正确定义了相关常量、变量和操作共5分;

用tensorflow编写基于梯度的动态权重代码，权重用tf.Variable定义，未整合到模型中

gen_nccl_ops.nccl_all_reduce()具体的参数和用法

attention lstm tf代码实现

tf 实现lstm + self-attention

python神经网络训练完了怎么输入得到结果_tensorflow学习笔记之简单的神经网络训练和测试...

This needs gradients relative to img什么意思

u.weights是一个张量，我要把他除以它的和，怎么做

内墙装修涂料行业发展趋势：预计2030年年复合增长率（CAGR）为5.6%（2024-2030）

大家在看

计算机组成与体系结构(性能设计)答案完整版-第八版

蓝牙室内定位服务源码！

如何降低开关电源纹波噪声

S7-200处理定时中断.zip西门子PLC编程实例程序源码下载

国自然标书医学下载国家自然科学基金面上课题申报中范文模板2023

最新推荐

python用TensorFlow做图像识别的实现

内墙装修涂料行业发展趋势：预计2030年年复合增长率（CAGR）为5.6%（2024-2030）

HTML挑战：30天技术学习之旅

【CodeBlocks精通指南】：一步到位安装wxWidgets库（新手必备）

andorid studio 配置ERROR: Cause: unable to find valid certification path to requested target

VC++实现文件顺序读写操作的技巧与实践

【大数据时代必备：Hadoop框架深度解析】：掌握核心组件，开启数据科学之旅

opencv的demo程序

NeuronTransportIGA: 使用IGA进行神经元材料传输模拟

【Linux多系统管理大揭秘】：专家级技巧助你轻松驾驭