plt.plot(np.arange(len(Y_pred)), X_train[:, -1], '->')

This code uses the matplotlib library to create a line plot. The x-axis values are generated using the numpy arange function, which creates an array of evenly spaced values from 0 to the length of Y_pred. The y-axis values are taken from the last column of the X_train dataset, which is accessed using the slicing notation [:, -1]. The plot is created using the '->' argument, which specifies that the plot should use a solid line with triangular markers pointing in the positive x-direction.

depth = np.arange(1, 15) err_train_list = [] err_test_list = [] clf = DecisionTreeClassifier(criterion='entropy') for d in depth: clf.set_params(max_depth=d) clf.fit(x_train, y_train) y_train_pred = clf.predict(x_train) err_train = 1-accuracy_score(y_train, y_train_pred) err_train_list.append(err_train) y_test_pred = clf.predict(x_test) err_test = 1-accuracy_score(y_test, y_test_pred) err_test_list.append(err_test) print(d, '测试集错误率：%.2f%%' % (100 * err_test)) plt.figure(facecolor='w') plt.plot(depth, err_test_list, 'ro-', markeredgecolor='k', lw=2, label='测试集错误率') plt.plot(depth, err_train_list, 'go-', markeredgecolor='k', lw=2, label='训练集错误率') plt.xlabel('决策树深度', fontsize=13) plt.ylabel('错误率', fontsize=13) plt.legend(loc='lower left', fontsize=13) plt.title('决策树深度与过拟合', fontsize=15) plt.grid(b=True, ls=':', color='#606060') depth = np.arange(1, 15) err_train_list = [] err_test_list = [] clf = DecisionTreeClassifier(criterion='entropy') for d in depth: clf.set_params(max_depth=d) clf.fit(x_train, y_train) y_train_pred = clf.predict(x_train) err_train = 1-accuracy_score(y_train, y_train_pred) err_train_list.append(err_train) y_test_pred = clf.predict(x_test) err_test = 1-accuracy_score(y_test, y_test_pred) err_test_list.append(err_test) print(d, '测试集错误率：%.2f%%' % (100 * err_test)) plt.figure(facecolor='w') plt.plot(depth, err_test_list, 'ro-', markeredgecolor='k', lw=2, label='测试集错误率') plt.plot(depth, err_train_list, 'go-', markeredgecolor='k', lw=2, label='训练集错误率') plt.xlabel('决策树深度', fontsize=13) plt.ylabel('错误率', fontsize=13) plt.legend(loc='lower left', fontsize=13) plt.title('决策树深度与过拟合', fontsize=15) plt.grid(b=True, ls=':', color='#606060') plt.show()

这段代码是用于通过决策树深度来观察决策树的过拟合情况。代码首先定义了一个范围为1到15的深度列表depth，然后创建了两个空列表err_train_list和err_test_list，用于存储训练集和测试集的错误率。接下来，使用决策树分类器DecisionTreeClassifier，并设置criterion为'entropy'，创建了一个决策树模型clf。然后，使用for循环遍历深度列表depth，每次将当前深度d设置为clf的最大深度，并使用x_train和y_train进行拟合。然后，分别对训练集和测试集进行预测，并计算错误率，将错误率添加到对应的列表中。最后，使用matplotlib库绘制了深度与错误率的图形，并显示出来。这段代码可以帮助我们观察决策树在不同深度下的过拟合情况，通过观察错误率的变化，可以选择一个合适的深度来构建决策树模型。

depth = np.arange(1, 15) err_train_list = [] err_test_list = [] clf = DecisionTreeClassifier(criterion='entropy') for d in depth: clf.set_params(max_depth=d) clf.fit(x_train, y_train) y_train_pred = clf.predict(x_train) err_train = 1-accuracy_score(y_train, y_train_pred) err_train_list.append(err_train) y_test_pred = clf.predict(x_test) err_test = 1-accuracy_score(y_test, y_test_pred) err_test_list.append(err_test) print(d, '测试集错误率：%.2f%%' % (100 * err_test)) plt.figure(facecolor='w') plt.plot(depth, err_test_list, 'ro-', markeredgecolor='k', lw=2, label='测试集错误率') plt.plot(depth, err_train_list, 'go-', markeredgecolor='k', lw=2, label='训练集错误率') plt.xlabel('决策树深度', fontsize=13) plt.ylabel('错误率', fontsize=13) plt.legend(loc='lower left', fontsize=13) plt.title('决策树深度与过拟合', fontsize=15) plt.grid(b=True, ls=':', color='#606060') plt.show()

这段代码是一个决策树的深度与过拟合关系的实验。代码首先定义了一个从1到14的深度范围，并初始化了两个空列表err_train_list和err_test_list来保存训练集和测试集的错误率。然后创建了一个DecisionTreeClassifier对象clf，并设置其criterion参数为'entropy'，即使用信息熵作为分裂标准。接下来的循环中，根据不同深度设置clf的max_depth参数，并使用训练集x_train和y_train进行拟合。然后分别对训练集和测试集进行预测，计算错误率并将其添加到对应的列表中。最后，打印出每个深度对应的测试集错误率。代码的最后部分使用matplotlib库绘制了一个图表，横坐标为深度，纵坐标为错误率。其中红色线表示测试集的错误率，绿色线表示训练集的错误率。图表的标题是"决策树深度与过拟合"，横轴标签是"决策树深度"，纵轴标签是"错误率"。通过这个实验，可以观察到决策树深度与过拟合之间的关系。一般来说，随着深度的增加，模型在训练集上的错误率会逐渐降低，但在测试集上的错误率可能会先降低后增加，这就是过拟合的现象。通过观察图表，可以找到一个合适的深度，使得模型在训练集和测试集上的错误率都较低。

阅读全文

plt.plot(np.arange(len(Y_pred)), X_train[:, -1], '->')

相关推荐

1_题目.docx

详解使用python绘制混淆矩阵（confusion_matrix）

支持向量机教程.docx

plt.plot(scaler.inverse_transform(y_test_pred.detach().numpy()), "b") plt.plot(scaler.inverse_transform(y_test_tensor.detach().numpy().reshape(-1, 1)), "r") plt.legend() plt.show()以上函数的X轴相距为1，怎么将X轴相距改为0.06

def plot_confuse(model, x_val, y_val): predictions = model.predict_classes(x_val) truelabel = y_val.argmax(axis=-1) # 将one-hot转化为label conf_mat = confusion_matrix(y_true=truelabel, y_pred=predictions) plt.figure() plot_confusion_matrix(conf_mat, range(np.max(truelabel) + 1))

基于Andorid的音乐播放器项目改进版本设计.zip

uniapp-machine-learning-from-scratch-05.rar

大家在看

zlg的Python应用

UART.rar_2407 串口_F2407_TMS320LF2407_uart c语言

cam350导出smt坐标

TA-Lib的whl文件

本科-OOAD-原婷婷-2015212109-188013989281

最新推荐

基于Andorid的音乐播放器项目改进版本设计.zip

uniapp-machine-learning-from-scratch-05.rar

game_patch_1.30.21.13250.pak

Windows下操作Linux图形界面的VNC工具

【SketchUp Ruby API：从入门到精通】

VMware虚拟机打开虚拟网络编辑器出现由于找不到vnetlib.dll,无法继续执行代码。重新安装程序可能会解决问题

基于Preact的高性能PWA实现定期天气信息更新

从停机到上线，EMC VNX5100控制器SP更换的实战演练

ubuntu labelme中文版安装

全新免费HTML5商业网站模板发布