用决策树回归算法对单车数据按基本数据处理流程： 1、初始建模 2、调参（绘制学习曲线调深度） 3、输出两种交叉验证获取模型稳定性结果（包括R方和均方误差） 4、从测试集中提取前100个样本绘制拟合曲线图，每个点需要特别标记出来。

时间: 2024-06-05 11:06:53 浏览: 81

利用决策树求解回归问题

以下是用决策树回归算法对单车数据的代码示例： # 导入所需模块 import pandas as pd import numpy as np import matplotlib.pyplot as plt from sklearn.tree import DecisionTreeRegressor from sklearn.model_selection import train_test_split, cross_val_score, learning_curve from sklearn.metrics import r2_score, mean_squared_error # 读取数据 data = pd.read_csv('bike.csv') # 划分特征和标签 X = data.iloc[:, 0:11] y = data.iloc[:, -1] # 划分训练集和测试集 X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=0) # 初始建模 dt = DecisionTreeRegressor(random_state=0) dt.fit(X_train, y_train) # 调参 train_scores, test_scores = learning_curve(dt, X_train, y_train, cv=10, scoring='r2', train_sizes=np.linspace(0.1, 1.0, 10)) train_mean = np.mean(train_scores, axis=1) train_std = np.std(train_scores, axis=1) test_mean = np.mean(test_scores, axis=1) test_std = np.std(test_scores, axis=1) plt.plot(np.linspace(0.1, 1.0, 10), train_mean, label='Training score') plt.plot(np.linspace(0.1, 1.0, 10), test_mean, label='Cross-validation score') plt.fill_between(np.linspace(0.1, 1.0, 10), train_mean - train_std, train_mean + train_std, alpha=0.2) plt.fill_between(np.linspace(0.1, 1.0, 10), test_mean - test_std, test_mean + test_std, alpha=0.2) plt.xlabel('Training set size') plt.ylabel('R2 score') plt.legend(loc='best') plt.show() depths = range(1, 21) train_scores, test_scores = [], [] for depth in depths: dt = DecisionTreeRegressor(max_depth=depth, random_state=0) dt.fit(X_train, y_train) train_scores.append(dt.score(X_train, y_train)) test_scores.append(dt.score(X_test, y_test)) plt.plot(depths, train_scores, label='Training score') plt.plot(depths, test_scores, label='Testing score') plt.xlabel('Depth of tree') plt.ylabel('R2 score') plt.legend(loc='best') plt.show() # 输出交叉验证结果 cv_scores = cross_val_score(dt, X_train, y_train, cv=10, scoring='r2') print('Cross-validation R2 scores:', cv_scores) print('Mean R2 score:', np.mean(cv_scores)) print('Mean squared error:', mean_squared_error(y_test, dt.predict(X_test))) # 绘制拟合曲线图 y_pred = dt.predict(X_test[:100]) plt.scatter(range(len(y_pred)), y_pred, c='r', label='Prediction') plt.scatter(range(len(y_test[:100])), y_test[:100], c='b', label='Actual') plt.xlabel('Sample index') plt.ylabel('Count') plt.legend(loc='best') plt.show()

阅读全文

相关推荐

决策树回归算法

机器学习与深度学习-通过决策树算法分类鸢尾花数据集iris求出错误率画出决策树并进行可视化（完整源码+文档）0.zip

人工智能和机器学习之回归算法：决策树回归：决策树回归算法基础.docx

人工智能和机器学习之回归算法：决策树回归：ID3算法详解.docx

人工智能和机器学习之回归算法：决策树回归：CART算法详解.docx

人工智能和机器学习之回归算法：决策树回归与随机森林集成学习.docx

人工智能和机器学习之回归算法：决策树回归：C4.5算法详解.docx

通过遗传算法对GBDT、XGBoost、LightBoost调参，并绘制决策树图和特

学习决策树算法使用的训练数据

酒数据预测-决策树预测.ipynb，酒数据预测-决策树算法.ipynb，文件使用决策树算法预测对酒数据进行预测归类和分析，详情见

人工智能和机器学习之分类算法：决策树：决策树在回归问题中的应用.docx

决策树分类算法处理鸢尾花数据.zip

大数据决策树算法数据挖掘分类算法之决策树

机器学习算法代码（包括SVM、回归、降维、聚类、决策树、逻辑回归、贝叶斯、随机森林、数据处理与特征工程）

机器学习：决策树cart算法在分类与回归的应用（下）-附件资源

数据挖掘：决策树算法及其应用

数学建模：数学建模算法学习

Python决策树分类算法学习

最新推荐

Python机器学习之决策树算法实例详解

基于MapReduce实现决策树算法

经济学中的数据科学：机器学习与深度学习方法

决策树剪枝算法的python实现方法详解

基于ID3决策树算法的实现(Python版)

MATLAB新功能：Multi-frame ViewRGB制作彩色图阴影

管理建模和仿真的文件

【实战篇：自定义损失函数】：构建独特损失函数解决特定问题，优化模型性能

在Flow-3D中如何根据水利工程的特定需求设定边界条件和进行网格划分，以便准确模拟水流问题？

XKCD Substitutions 3-crx插件：创新的网页文字替换工具