function [weight,bias] = SGD(hidenActiFcn,outputActiFcn,weight,bias,nabla_weight,nabla_bias,nlayer,mini_batch_size,eta,a,z,y,lambda,n) %SGD stochastic gradient descent delta = (a{nlayer}-y).outputActiFcn(z{nlayer}); nabla_bias{end} = mean(delta,2); nabla_weight{end} = (deltaa{end-1}')/mini_batch_size; for in = nlayer-1:-1:2 delta = weight{in+1}'delta.hidenActiFcn(z{in}); nabla_bias{in} = mean(delta,2); nabla_weight{in} = (deltaa{in-1}')/mini_batch_size; end for in = 2:nlayer weight{in} = (1-etalambda/n)weight{in} - etanabla_weight{in}; bias{in} = bias{in} - eta*nabla_bias{in}; end end

时间: 2024-02-18 18:06:16 浏览: 58

这段代码是实现了一个基于随机梯度下降（Stochastic Gradient Descent, SGD）算法的神经网络训练函数。该函数接收的输入参数包括： - `hidenActiFcn`，隐藏层的激活函数类型； - `outputActiFcn`，输出层的激活函数类型； - `weight`，网络的权重参数； - `bias`，网络的偏置参数； - `nabla_weight`，当前mini-batch的权重梯度； - `nabla_bias`，当前mini-batch的偏置梯度； - `nlayer`，神经网络的层数； - `mini_batch_size`，mini-batch的大小； - `eta`，学习率； - `a`，每层的激活值； - `z`，每层的加权输入值； - `y`，期望输出值； - `lambda`，L2正则化系数； - `n`，训练集大小。具体来说，该函数实现了以下步骤： 1. 计算输出层的偏导数 `delta = (a{nlayer}-y).*outputActiFcn(z{nlayer})`，并计算输出层的权重和偏置的梯度 `nabla_bias{end} = mean(delta,2); nabla_weight{end} = (delta*a{end-1}')/mini_batch_size;`。 2. 从倒数第二层开始，计算每一层的偏导数和梯度，并将其存储在 `delta`、`nabla_bias` 和 `nabla_weight` 中。 3. 根据梯度下降算法更新每层的权重和偏置参数。 4. 返回更新后的权重和偏置参数。其中，步骤3中的权重和偏置参数的更新公式为： ```matlab weight{in} = (1-eta*lambda/n)*weight{in} - eta*nabla_weight{in}; bias{in} = bias{in} - eta*nabla_bias{in}; ``` 其中，`in` 表示当前层的下标。这里使用了L2正则化来防止过拟合。

阅读全文

相关推荐

sample_weight.7z

在tensorflow实现直接读取网络的参数(weight and bias)的值

Mini-batch-SGD-master.zip_MBGD小批量梯度下降_batch_多项式回归

function [weight,bias] = SGD(hidenActiFcn,outputActiFcn,weight,bias,nabla_weight,nabla_bias,nlayer,mini_batch_size,eta,a,z,y,lambda,n)

linear-regression-stochatic_minibatch_gradient_descent-on-bostion-dataset

deeplearning-master_Rlanguage_BatchNormalization_batch_

Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour

6_1_AlexNet.zip_AlexNet网络_alexnet_batch_tablelqi_watch9rp

记录_mini_imagenet1

ResNet-with-LRWarmUp-TF2：TensorFlow实施“准确，大型Minibatch SGD：在1小时内训练ImageNet”

【深度学习】 BGD、SGD、mini-batch GD-附件资源

【目标检测】epoch、batch、batch_size理解

垃圾识别batch_4

batch_2.rar

trainer = torch.optim.SGD([ {"params":net[0].weight,'weight_decay': wd}, {"params":net[0].bias}], lr=lr)

自动丝印设备（sw18可编辑+工程图+Bom)全套设计资料100%好用.zip

链板式连续提升机6米高度（sw18可编辑+工程图）全套设计资料100%好用.zip

大家在看

递推最小二乘辨识

论文研究-8位CISC微处理器的设计与实现.pdf

设置段落格式-word教学内容的PPT课件

QRCT调试指导.docx

python中matplotlib实现最小二乘法拟合的过程详解

最新推荐

pytorch 状态字典:state_dict使用详解

pytorch之inception_v3的实现案例

python实现随机梯度下降（SGD）

自动丝印设备（sw18可编辑+工程图+Bom)全套设计资料100%好用.zip

AkariBot-Core：可爱AI机器人实现与集成指南

管理建模和仿真的文件

CC-LINK远程IO模块AJ65SBTB1现场应用指南：常见问题快速解决

switch语句和for语句的区别和使用方法

易语言实现程序启动限制的源码示例

"互动学习：行动中的多样性与论文攻读经历"