首页随机森林混淆矩阵代码

随机森林混淆矩阵代码

时间: 2024-06-26 20:01:25 浏览: 257

随机森林混淆矩阵（Confusion Matrix）用于评估分类模型性能，它可以帮助我们理解模型在预测过程中对各个类别的分类效果。在Python中，我们可以使用`sklearn`库中的`confusion_matrix`函数来计算随机森林模型的混淆矩阵。以下是一个简单的例子： ```python from sklearn.ensemble import RandomForestClassifier from sklearn.datasets import make_classification from sklearn.model_selection import train_test_split from sklearn.metrics import confusion_matrix, classification_report # 创建示例数据 X, y = make_classification(n_samples=1000, n_features=10, n_informative=5, random_state=42) # 划分训练集和测试集 X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42) # 训练随机森林模型 rf_model = RandomForestClassifier(random_state=42) rf_model.fit(X_train, y_train) # 预测 y_pred = rf_model.predict(X_test) # 计算混淆矩阵 cm = confusion_matrix(y_test, y_pred) print("Confusion Matrix:\n", cm) # 获取分类报告（包括精确率、召回率等） report = classification_report(y_test, y_pred) print("\nClassification Report:\n", report) ``` 在这个例子中，我们首先创建了一个随机森林分类器，然后用训练数据拟合模型。接着，我们用测试数据进行预测，并计算混淆矩阵来查看模型在各个类别上的表现。分类报告则提供了更详细的性能指标。

阅读全文