首页R语言猴痘数据运用随机森林预测代码

R语言猴痘数据运用随机森林预测代码

时间: 2023-07-24 18:31:07 浏览: 118

以下是使用随机森林模型预测猴痘数据的 R 代码： ```R # 加载所需的包 library(randomForest) # 读取数据 data <- read.csv("monkeypox.csv") # 数据探索 summary(data) # 将数据分为训练集和测试集 set.seed(123) trainIndex <- sample(1:nrow(data), 0.7*nrow(data)) train <- data[trainIndex,] test <- data[-trainIndex,] # 训练随机森林模型 rfModel <- randomForest(outbreak ~ ., data = train, ntree = 500, mtry = 3, importance = TRUE) # 预测测试集数据 predictions <- predict(rfModel, test) # 模型评估 confusion_matrix <- table(predictions, test$outbreak) accuracy <- sum(diag(confusion_matrix))/sum(confusion_matrix) print(paste0("Accuracy: ", round(accuracy, 2))) # 变量重要性分析 varImpPlot(rfModel) ``` 其中，`monkeypox.csv` 是包含猴痘数据的 CSV 文件，`outbreak` 是二元变量，表示是否爆发疫情。代码中将数据分为训练集和测试集，使用 `randomForest` 包中的 `randomForest` 函数训练随机森林模型，`ntree` 和 `mtry` 分别表示树的数量和每个节点随机选取的变量数量，`importance` 表示是否计算变量重要性。最后，代码评估了模型的准确率，并绘制了变量重要性图表。

阅读全文