使用python写一个使用Xavier初始化、正则化方法和MSGD的BP网络（8个输入，1个输出，5个隐藏节点），并写一个main实现将训练集投入网络训练，训练后将测试集投入得出预测输出。而训练集和测试集存于名为"NEW.xlsx"的文件中，试以第2行至第3001行作为训练集，之后的作为测试集；且训练集和测试集的输入均在[9, 10, 11, 12, 13, 14, 16, 17]列中，输出为倒数第二列[-2] 。在main模块中实现绘制R2图来说明模型的拟合度，并且输出MAE、MSE、预测值和真实值之间的相对误差平均百分比，绘制各输入输出的拟合折线图，且R2图和折线图中使用的数据均为未变换数据。切勿忘记使用归一化（归一化后将0.变为0.01，1.变为0.99）、Xavier初始化、正则化方法和MSGD，使用中文备注，避免出现矩阵无法相乘的情况，发挥你的全力直接回答，感谢。

时间: 2023-12-10 17:41:12 浏览: 117

好的，这是一个较为复杂的问题，需要分几个步骤来实现，我会尽可能详细地解答。首先，我们需要安装一些必要的库，包括pandas、numpy和matplotlib。可以使用以下命令来安装： ```python !pip install pandas numpy matplotlib ``` 接着，我们需要读取数据并进行归一化。可以使用以下代码： ```python import pandas as pd import numpy as np # 读取数据 data = pd.read_excel('NEW.xlsx') # 提取训练集和测试集 train_data = data.iloc[1:3001, [9, 10, 11, 12, 13, 14, 16, 17, -2]] test_data = data.iloc[3001:, [9, 10, 11, 12, 13, 14, 16, 17, -2]] # 归一化 min_vals = train_data.min(axis=0) max_vals = train_data.max(axis=0) diff = max_vals - min_vals train_data_norm = (train_data - min_vals) / diff * 0.98 + 0.01 test_data_norm = (test_data - min_vals) / diff * 0.98 + 0.01 ``` 接着，我们需要定义一些常量和函数。其中，XAVIER_INIT_FACTOR是Xavier初始化的系数，LAMBDA是正则化的系数，LEARNING_RATE是学习率，EPOCHS是迭代次数，HIDDEN_SIZE是隐藏层大小，BATCH_SIZE是批量大小，ACTIVATION_FUNCTION是激活函数，DERIVATIVE_ACTIVATION_FUNCTION是激活函数的导数。 ```python XAVIER_INIT_FACTOR = np.sqrt(6) / np.sqrt(8 + 5 + 1) LAMBDA = 0.001 LEARNING_RATE = 0.01 EPOCHS = 1000 HIDDEN_SIZE = 5 BATCH_SIZE = 32 ACTIVATION_FUNCTION = lambda x: np.tanh(x) DERIVATIVE_ACTIVATION_FUNCTION = lambda x: 1 - np.tanh(x) ** 2 ``` 接着，我们需要初始化网络的权重和偏置。可以使用以下代码： ```python # 初始化权重和偏置 weights_input_hidden = np.random.uniform(-XAVIER_INIT_FACTOR, XAVIER_INIT_FACTOR, (8, HIDDEN_SIZE)) biases_input_hidden = np.zeros((1, HIDDEN_SIZE)) weights_hidden_output = np.random.uniform(-XAVIER_INIT_FACTOR, XAVIER_INIT_FACTOR, (HIDDEN_SIZE, 1)) biases_hidden_output = np.zeros((1, 1)) ``` 我们还需要定义一些辅助函数，包括计算损失函数、前向传播、反向传播和更新权重和偏置的函数。 ```python def calculate_loss(predictions, targets): return np.mean((predictions - targets) ** 2) def forward(X, weights_input_hidden, biases_input_hidden, weights_hidden_output, biases_hidden_output, activation_function): hidden_layer_input = np.dot(X, weights_input_hidden) + biases_input_hidden hidden_layer_output = activation_function(hidden_layer_input) output_layer_input = np.dot(hidden_layer_output, weights_hidden_output) + biases_hidden_output predictions = output_layer_input return hidden_layer_input, hidden_layer_output, output_layer_input, predictions def backward(X, targets, hidden_layer_input, hidden_layer_output, output_layer_input, predictions, weights_hidden_output, activation_function, derivative_activation_function, lambd): error = 2 * (predictions - targets) output_layer_error = error hidden_layer_error = np.dot(output_layer_error, weights_hidden_output.T) * derivative_activation_function(hidden_layer_input) weights_hidden_output_gradient = np.dot(hidden_layer_output.T, output_layer_error) biases_hidden_output_gradient = np.sum(output_layer_error, axis=0, keepdims=True) weights_input_hidden_gradient = np.dot(X.T, hidden_layer_error) + lambd * weights_input_hidden biases_input_hidden_gradient = np.sum(hidden_layer_error, axis=0, keepdims=True) return weights_input_hidden_gradient, biases_input_hidden_gradient, weights_hidden_output_gradient, biases_hidden_output_gradient def update_weights(weights_input_hidden, biases_input_hidden, weights_hidden_output, biases_hidden_output, weights_input_hidden_gradient, biases_input_hidden_gradient, weights_hidden_output_gradient, biases_hidden_output_gradient, learning_rate): weights_input_hidden -= learning_rate * weights_input_hidden_gradient biases_input_hidden -= learning_rate * biases_input_hidden_gradient weights_hidden_output -= learning_rate * weights_hidden_output_gradient biases_hidden_output -= learning_rate * biases_hidden_output_gradient return weights_input_hidden, biases_input_hidden, weights_hidden_output, biases_hidden_output ``` 接着，我们可以开始训练模型。可以使用以下代码： ```python # 将训练集按批量大小分成多个批量 num_batches = int(np.ceil(len(train_data_norm) / BATCH_SIZE)) train_data_norm_batches = np.array_split(train_data_norm, num_batches) # 记录训练过程中的损失和R2值 loss_history = [] r2_history = [] # 训练模型 for epoch in range(EPOCHS): for i in range(num_batches): batch = train_data_norm_batches[i] X_batch = batch.iloc[:, :-1].values y_batch = batch.iloc[:, -1].values.reshape(-1, 1) hidden_layer_input, hidden_layer_output, output_layer_input, predictions = forward(X_batch, weights_input_hidden, biases_input_hidden, weights_hidden_output, biases_hidden_output, ACTIVATION_FUNCTION) loss = calculate_loss(predictions, y_batch) weights_input_hidden_gradient, biases_input_hidden_gradient, weights_hidden_output_gradient, biases_hidden_output_gradient = backward(X_batch, y_batch, hidden_layer_input, hidden_layer_output, output_layer_input, predictions, weights_hidden_output, ACTIVATION_FUNCTION, DERIVATIVE_ACTIVATION_FUNCTION, LAMBDA) weights_input_hidden, biases_input_hidden, weights_hidden_output, biases_hidden_output = update_weights(weights_input_hidden, biases_input_hidden, weights_hidden_output, biases_hidden_output, weights_input_hidden_gradient, biases_input_hidden_gradient, weights_hidden_output_gradient, biases_hidden_output_gradient, LEARNING_RATE) train_hidden_layer_input, train_hidden_layer_output, train_output_layer_input, train_predictions = forward(train_data_norm.iloc[:, :-1].values, weights_input_hidden, biases_input_hidden, weights_hidden_output, biases_hidden_output, ACTIVATION_FUNCTION) train_loss = calculate_loss(train_predictions, train_data_norm.iloc[:, -1].values.reshape(-1, 1)) train_r2 = 1 - train_loss / np.var(train_data_norm.iloc[:, -1].values.reshape(-1, 1)) loss_history.append(train_loss) r2_history.append(train_r2) # 绘制R2图 import matplotlib.pyplot as plt plt.plot(r2_history) plt.xlabel('Epochs') plt.ylabel('R2') plt.show() ``` 接着，我们可以使用测试集来测试模型，并计算MAE、MSE和相对误差平均百分比。可以使用以下代码： ```python # 测试模型 test_hidden_layer_input, test_hidden_layer_output, test_output_layer_input, test_predictions = forward(test_data_norm.iloc[:, :-1].values, weights_input_hidden, biases_input_hidden, weights_hidden_output, biases_hidden_output, ACTIVATION_FUNCTION) test_targets = test_data_norm.iloc[:, -1].values.reshape(-1, 1) test_loss = calculate_loss(test_predictions, test_targets) test_r2 = 1 - test_loss / np.var(test_targets) test_mae = np.mean(np.abs((test_targets - test_predictions) / test_targets)) * 100 test_mse = np.mean((test_targets - test_predictions) ** 2) print('Test R2:', test_r2) print('Test MAE:', test_mae) print('Test MSE:', test_mse) # 绘制各输入输出的拟合折线图 for i in range(8): plt.figure() plt.plot(test_targets[:, 0], label='True') plt.plot(test_predictions[:, 0], label='Predicted') plt.xlabel('Samples') plt.ylabel('Value') plt.title('Input ' + str(i+1)) plt.legend() plt.show() ``` 最后，我们需要反归一下归一化，得到真实的预测值和真实值。可以使用以下代码： ```python # 反归一化 test_predictions_real = (test_predictions - 0.01) / 0.98 * diff[-2] + min_vals[-2] test_targets_real = (test_targets - 0.01) / 0.98 * diff[-2] + min_vals[-2] # 输出预测值和真实值之间的相对误差平均百分比 relative_error = np.mean(np.abs((test_targets_real - test_predictions_real) / test_targets_real)) * 100 print('Relative Error:', relative_error) ``` 完整代码如下：

阅读全文

相关推荐

BP神经网络预测（python）

bp.rar_BP_bp神经_python 神经网络_神经网络python

bpnn_4层BP神经网络解决分类/预测问题_

springboot应急救援物资管理系统.zip

遥感图像处理-YOLOv11改进版在卫星船舶识别中的应用.pdf

智慧社区物联网解决方案PPT(31页).pptx

2.4G输出小数分数锁相环与频率综合器进阶项目-涵盖Cadence全套工具与gpdk45nm工艺，丰富仿真测试与完整版图资源，适合锁相环新手进阶学习 ,基于Cadence的2.4G小数分数锁相环进阶

（GUI界面形式）MATLAB教室人数统计.zip

生物医学研究-YOLOv11细胞分割算法在显微镜图像中的精准定位.pdf

生物医学新应用-YOLOv11显微图像细胞计数与分类算法优化.pdf

ssm大学生兼职跟踪系统.zip

update0214.sql

springboot时间管理系统--.zip

遥感影像处理-YOLOv11卫星图像洪涝灾害区域检测算法.pdf

双有源桥DAB DC-DC变换器负载电流前馈控制策略：单移相SPS改善动态性能与调节时间对比研究（MATLAB Simulink与Plec环境）,双有源桥DAB DC-DC变换器负载电流前馈控制策略优

《Petrel地震数据解析与RE气藏教程：数据驱动的勘探与开发实践》,深入解析Petrel地震数据与RE技术：气藏教程实践及数据分析指南,petrel地震、RE、气藏教程带数据 ,petrel地震;

大家在看

基于ArcPy实现的熵权法赋值地理处理工具

ARINC664协议 EDE描述

GAMMA软件的InSAR处理流程.pptx

Stateflow建模规范

伦茨变频器8200手册

最新推荐

springboot应急救援物资管理系统.zip

遥感图像处理-YOLOv11改进版在卫星船舶识别中的应用.pdf

智慧社区物联网解决方案PPT(31页).pptx

2.4G输出小数分数锁相环与频率综合器进阶项目-涵盖Cadence全套工具与gpdk45nm工艺，丰富仿真测试与完整版图资源，适合锁相环新手进阶学习 ,基于Cadence的2.4G小数分数锁相环进阶

（GUI界面形式）MATLAB教室人数统计.zip

Spring Websocket快速实现与SSMTest实战应用

电力电子技术的智能化：数据中心的智能电源管理

通过spark sql读取关系型数据库mysql中的数据

新版微软inspect工具下载：32位与64位版本

如何运用电力电子技术实现IT设备的能耗监控