for train_id, test_id in skf.split(x, y): print("============{}-th cross validation============".format(i)) X_train, X_test = x[train_id,:], x[test_id,:]怎么变成slices

你可以使用 Python 中的切片（slicing）来实现对数据的切割。例如，如果你想把一个数组 x 切割成两半，可以使用以下代码： ``` half = len(x) // 2 x1 = x[:half] x2 = x[half:] ``` 这里，`half` 是数组 `x` 的长度除以 2 的结果，也就是数组的中间位置。`x[:half]` 表示从数组的开头开始，取到 `half` 这个位置之前的所有元素，也就是数组的前一半；`x[half:]` 表示从数组的 `half` 位置开始，取到数组的末尾的所有元素，也就是数组的后一半。在你的代码中，如果你想把数组 `x` 切割成 `train_id` 和 `test_id` 两部分，可以使用以下代码： ``` train_x = x[train_id] test_x = x[test_id] ``` 这里，`train_id` 和 `test_id` 都是一维数组，它们的元素是数组 `x` 中被选中的元素的下标。`x[train_id]` 表示从数组 `x` 中取出下标为 `train_id` 中所有元素的值，这就是训练集；`x[test_id]` 同理，表示从数组 `x` 中取出下标为 `test_id` 中所有元素的值，这就是测试集。

修改和补充下列代码得到十折交叉验证的平均每一折auc值和平均每一折aoc曲线，平均每一折分类报告以及平均每一折混淆矩阵 min_max_scaler = MinMaxScaler() X_train1, X_test1 = x[train_id], x[test_id] y_train1, y_test1 = y[train_id], y[test_id] # apply the same scaler to both sets of data X_train1 = min_max_scaler.fit_transform(X_train1) X_test1 = min_max_scaler.transform(X_test1) X_train1 = np.array(X_train1) X_test1 = np.array(X_test1) config = get_config() tree = gcForest(config) tree.fit(X_train1, y_train1) y_pred11 = tree.predict(X_test1) y_pred1.append(y_pred11 X_train.append(X_train1) X_test.append(X_test1) y_test.append(y_test1) y_train.append(y_train1) X_train_fuzzy1, X_test_fuzzy1 = X_fuzzy[train_id], X_fuzzy[test_id] y_train_fuzzy1, y_test_fuzzy1 = y_sampled[train_id], y_sampled[test_id] X_train_fuzzy1 = min_max_scaler.fit_transform(X_train_fuzzy1) X_test_fuzzy1 = min_max_scaler.transform(X_test_fuzzy1) X_train_fuzzy1 = np.array(X_train_fuzzy1) X_test_fuzzy1 = np.array(X_test_fuzzy1) config = get_config() tree = gcForest(config) tree.fit(X_train_fuzzy1, y_train_fuzzy1) y_predd = tree.predict(X_test_fuzzy1) y_pred.append(y_predd) X_test_fuzzy.append(X_test_fuzzy1) y_test_fuzzy.append(y_test_fuzzy1)y_pred = to_categorical(np.concatenate(y_pred), num_classes=3) y_pred1 = to_categorical(np.concatenate(y_pred1), num_classes=3) y_test = to_categorical(np.concatenate(y_test), num_classes=3) y_test_fuzzy = to_categorical(np.concatenate(y_test_fuzzy), num_classes=3) print(y_pred.shape) print(y_pred1.shape) print(y_test.shape) print(y_test_fuzzy.shape) # 深度森林 report1 = classification_report(y_test, y_prprint("DF",report1) report = classification_report(y_test_fuzzy, y_pred) print("DF-F",report) mse = mean_squared_error(y_test, y_pred1) rmse = math.sqrt(mse) print('深度森林RMSE:', rmse) print('深度森林Accuracy:', accuracy_score(y_test, y_pred1)) mse = mean_squared_error(y_test_fuzzy, y_pred) rmse = math.sqrt(mse) print('F深度森林RMSE:', rmse) print('F深度森林Accuracy:', accuracy_score(y_test_fuzzy, y_pred)) mse = mean_squared_error(y_test, y_pred) rmse = math.sqrt(mse)

首先，需要将代码放入循环中进行十折交叉验证。每一折都需要记录相应的分类报告、混淆矩阵、auc值和aoc曲线。以下是修改后的代码： ``` from sklearn.preprocessing import MinMaxScaler from sklearn.metrics import classification_report, confusion_matrix, roc_auc_score, roc_curve, auc from sklearn.model_selection import StratifiedKFold from gcforest.gcforest import GCForest import numpy as np import math min_max_scaler = MinMaxScaler() config = get_config() tree = gcForest(config) X_train = [] X_test = [] y_train = [] y_test = [] X_test_fuzzy = [] y_test_fuzzy = [] y_pred = [] y_pred1 = [] auc_scores = [] aoc_fprs = [] aoc_tprs = [] skf = StratifiedKFold(n_splits=10) for train_id, test_id in skf.split(x, y): X_train1, X_test1 = x[train_id], x[test_id] y_train1, y_test1 = y[train_id], y[test_id] X_train1 = min_max_scaler.fit_transform(X_train1) X_test1 = min_max_scaler.transform(X_test1) X_train1 = np.array(X_train1) X_test1 = np.array(X_test1) tree.fit(X_train1, y_train1) y_pred11 = tree.predict(X_test1) y_pred1.append(y_pred11) X_train.append(X_train1) X_test.append(X_test1) y_test.append(y_test1) y_train.append(y_train1) X_train_fuzzy1, X_test_fuzzy1 = X_fuzzy[train_id], X_fuzzy[test_id] y_train_fuzzy1, y_test_fuzzy1 = y_sampled[train_id], y_sampled[test_id] X_train_fuzzy1 = min_max_scaler.fit_transform(X_train_fuzzy1) X_test_fuzzy1 = min_max_scaler.transform(X_test_fuzzy1) X_train_fuzzy1 = np.array(X_train_fuzzy1) X_test_fuzzy1 = np.array(X_test_fuzzy1) tree.fit(X_train_fuzzy1, y_train_fuzzy1) y_predd = tree.predict(X_test_fuzzy1) y_pred.append(y_predd) X_test_fuzzy.append(X_test_fuzzy1) y_test_fuzzy.append(y_test_fuzzy1) y_pred_proba = tree.predict_proba(X_test1) auc_score = roc_auc_score(y_test1, y_pred_proba, multi_class='ovr') auc_scores.append(auc_score) fpr, tpr, _ = roc_curve(y_test1, y_pred_proba[:, 1]) aoc_fprs.append(fpr) aoc_tprs.append(tpr) y_pred = to_categorical(np.concatenate(y_pred), num_classes=3) y_pred1 = to_categorical(np.concatenate(y_pred1), num_classes=3) y_test = to_categorical(np.concatenate(y_test), num_classes=3) y_test_fuzzy = to_categorical(np.concatenate(y_test_fuzzy), num_classes=3) print(y_pred.shape) print(y_pred1.shape) print(y_test.shape) print(y_test_fuzzy.shape) # 深度森林 report1 = classification_report(y_test, y_pred1) print("DF Classification Report (10-Fold CV): \n", report1) report = classification_report(y_test_fuzzy, y_pred) print("DF-F Classification Report (10-Fold CV): \n", report) mse = mean_squared_error(y_test, y_pred1) rmse = math.sqrt(mse) print('DF RMSE (10-Fold CV):', rmse) print('DF Accuracy (10-Fold CV):', accuracy_score(y_test, y_pred1)) mse = mean_squared_error(y_test_fuzzy, y_pred) rmse = math.sqrt(mse) print('DF-F RMSE (10-Fold CV):', rmse) print('DF-F Accuracy (10-Fold CV):', accuracy_score(y_test_fuzzy, y_pred)) mean_auc = np.mean(auc_scores) print("Mean AUC (10-Fold CV): ", mean_auc) mean_fpr = np.mean(aoc_fprs, axis=0) mean_tpr = np.mean(aoc_tprs, axis=0) mean_auc = auc(mean_fpr, mean_tpr) print("Mean AOC (10-Fold CV): ", mean_auc) ``` 在修改后的代码中，`StratifiedKFold`函数被用来进行十折交叉验证。每一折的训练数据和测试数据都是通过`train_id`和`test_id`来确定的。在每一折的训练和测试之后，需要记录相应的分类报告、混淆矩阵、auc值和aoc曲线。最后，需要计算平均每一折的auc值和aoc曲线。

修改和补充下列代码得到十折交叉验证的平均auc值和平均aoc曲线，平均分类报告以及平均混淆矩阵 min_max_scaler = MinMaxScaler() X_train1, X_test1 = x[train_id], x[test_id] y_train1, y_test1 = y[train_id], y[test_id] # apply the same scaler to both sets of data X_train1 = min_max_scaler.fit_transform(X_train1) X_test1 = min_max_scaler.transform(X_test1) X_train1 = np.array(X_train1) X_test1 = np.array(X_test1) config = get_config() tree = gcForest(config) tree.fit(X_train1, y_train1) y_pred11 = tree.predict(X_test1) y_pred1.append(y_pred11 X_train.append(X_train1) X_test.append(X_test1) y_test.append(y_test1) y_train.append(y_train1) X_train_fuzzy1, X_test_fuzzy1 = X_fuzzy[train_id], X_fuzzy[test_id] y_train_fuzzy1, y_test_fuzzy1 = y_sampled[train_id], y_sampled[test_id] X_train_fuzzy1 = min_max_scaler.fit_transform(X_train_fuzzy1) X_test_fuzzy1 = min_max_scaler.transform(X_test_fuzzy1) X_train_fuzzy1 = np.array(X_train_fuzzy1) X_test_fuzzy1 = np.array(X_test_fuzzy1) config = get_config() tree = gcForest(config) tree.fit(X_train_fuzzy1, y_train_fuzzy1) y_predd = tree.predict(X_test_fuzzy1) y_pred.append(y_predd) X_test_fuzzy.append(X_test_fuzzy1) y_test_fuzzy.append(y_test_fuzzy1)y_pred = to_categorical(np.concatenate(y_pred), num_classes=3) y_pred1 = to_categorical(np.concatenate(y_pred1), num_classes=3) y_test = to_categorical(np.concatenate(y_test), num_classes=3) y_test_fuzzy = to_categorical(np.concatenate(y_test_fuzzy), num_classes=3) print(y_pred.shape) print(y_pred1.shape) print(y_test.shape) print(y_test_fuzzy.shape) # 深度森林 report1 = classification_report(y_test, y_prprint("DF",report1) report = classification_report(y_test_fuzzy, y_pred) print("DF-F",report) mse = mean_squared_error(y_test, y_pred1) rmse = math.sqrt(mse) print('深度森林RMSE:', rmse) print('深度森林Accuracy:', accuracy_score(y_test, y_pred1)) mse = mean_squared_error(y_test_fuzzy, y_pred) rmse = math.sqrt(mse) print('F深度森林RMSE:', rmse) print('F深度森林Accuracy:', accuracy_score(y_test_fuzzy, y_pred)) mse = mean_squared_error(y_test, y_pred) rmse = math.sqrt(mse) print('F?深度森林RMSE:', rmse) print('F?深度森林Accuracy:', accuracy_score(y_test, y_pred))

以下是修改和补充后的代码，实现了十折交叉验证的平均auc值和平均aoc曲线，平均分类报告以及平均混淆矩阵： ```python from sklearn.preprocessing import MinMaxScaler from sklearn.metrics import classification_report, confusion_matrix, roc_curve, auc from sklearn.model_selection import StratifiedKFold min_max_scaler = MinMaxScaler() X_train, X_test, y_train, y_test = [], [], [], [] X_train_fuzzy, X_test_fuzzy, y_train_fuzzy, y_test_fuzzy = [], [], [], [] y_pred, y_pred1 = [], [] y_pred_proba, y_pred_proba1 = [], [] config = get_config() tree = gcForest(config) skf = StratifiedKFold(n_splits=10) for train_id, test_id in skf.split(x, y): # split data and normalize X_train1, X_test1 = x[train_id], x[test_id] y_train1, y_test1 = y[train_id], y[test_id] X_train1 = min_max_scaler.fit_transform(X_train1) X_test1 = min_max_scaler.transform(X_test1) X_train1 = np.array(X_train1) X_test1 = np.array(X_test1) # train gcForest tree.fit(X_train1, y_train1) # predict on test set y_pred11 = tree.predict(X_test1) y_pred_proba11 = tree.predict_proba(X_test1) # append predictions and test data y_pred1.append(y_pred11) y_pred_proba1.append(y_pred_proba11) X_train.append(X_train1) X_test.append(X_test1) y_test.append(y_test1) y_train.append(y_train1) # split fuzzy data and normalize X_train_fuzzy1, X_test_fuzzy1 = X_fuzzy[train_id], X_fuzzy[test_id] y_train_fuzzy1, y_test_fuzzy1 = y_sampled[train_id], y_sampled[test_id] X_train_fuzzy1 = min_max_scaler.fit_transform(X_train_fuzzy1) X_test_fuzzy1 = min_max_scaler.transform(X_test_fuzzy1) X_train_fuzzy1 = np.array(X_train_fuzzy1) X_test_fuzzy1 = np.array(X_test_fuzzy1) # train gcForest on fuzzy data tree.fit(X_train_fuzzy1, y_train_fuzzy1) # predict on fuzzy test set y_predd = tree.predict(X_test_fuzzy1) y_predd_proba = tree.predict_proba(X_test_fuzzy1) # append predictions and test data y_pred.append(y_predd) y_pred_proba.append(y_predd_proba) X_test_fuzzy.append(X_test_fuzzy1) y_test_fuzzy.append(y_test_fuzzy1) # concatenate and convert to categorical y_pred = to_categorical(np.concatenate(y_pred), num_classes=3) y_pred1 = to_categorical(np.concatenate(y_pred1), num_classes=3) y_test = to_categorical(np.concatenate(y_test), num_classes=3) y_test_fuzzy = to_categorical(np.concatenate(y_test_fuzzy), num_classes=3) # calculate and print average accuracy and RMSE mse = mean_squared_error(y_test, y_pred1) rmse = math.sqrt(mse) print('深度森林RMSE:', rmse) print('深度森林Accuracy:', accuracy_score(y_test, y_pred1)) mse = mean_squared_error(y_test_fuzzy, y_pred) rmse = math.sqrt(mse) print('F深度森林RMSE:', rmse) print('F深度森林Accuracy:', accuracy_score(y_test_fuzzy, y_pred)) mse = mean_squared_error(y_test, y_pred) rmse = math.sqrt(mse) print('F?深度森林RMSE:', rmse) print('F?深度森林Accuracy:', accuracy_score(y_test, y_pred)) # calculate and print average classification report report1 = classification_report(y_test, y_pred1) print("DF", report1) report = classification_report(y_test_fuzzy, y_pred) print("DF-F", report) # calculate and print average confusion matrix cm1 = confusion_matrix(y_test.argmax(axis=1), y_pred1.argmax(axis=1)) cm = confusion_matrix(y_test_fuzzy.argmax(axis=1), y_pred.argmax(axis=1)) print('DF Confusion Matrix:') print(cm1) print('DF-F Confusion Matrix:') print(cm) # calculate and print average ROC curve and AUC value fpr1, tpr1, threshold1 = roc_curve(y_test.ravel(), y_pred_proba1.ravel()) fpr, tpr, threshold = roc_curve(y_test_fuzzy.ravel(), y_pred_proba.ravel()) roc_auc1 = auc(fpr1, tpr1) roc_auc = auc(fpr, tpr) print('DF ROC AUC:', roc_auc1) print('DF-F ROC AUC:', roc_auc) # plot average ROC curve plt.title('Receiver Operating Characteristic') plt.plot(fpr1, tpr1, 'b', label = 'DF AUC = %0.2f' % roc_auc1) plt.plot(fpr, tpr, 'g', label = 'DF-F AUC = %0.2f' % roc_auc) plt.legend(loc = 'lower right') plt.plot([0, 1], [0, 1],'r--') plt.xlim([0, 1]) plt.ylim([0, 1]) plt.ylabel('True Positive Rate') plt.xlabel('False Positive Rate') plt.show() ```

for train_id, test_id in skf.split(x, y): print("============{}-th cross validation============".format(i)) X_train, X_test = x[train_id,:], x[test_id,:]怎么变成slices

相关推荐

keras model.fit 解决validation_spilt=num 的问题

python中导入 train_test_split提示错误的解决

pd_split_train_test.rar_pandas_pandas对数据分类_pd.split_split_数据分类

怎么求十折分层抽样后的平均分类报告python不用cross_val_score

随机森林调参_随机森林调参实战（信用卡欺诈预测）

怎么求得分层抽样的每一层的classfication_report对应分类指标的平均值的非标类报告python代码

怎么求得分层抽样的每一层的每一行每一列的classfication_report对应分类指标的平均值的分类报告python代码

我应该怎么样才能将样本随机分成十分，然后跑数据

10折分层抽样，每一折有一个classification_report怎么求这10个classification report，求这10个放在一起求的平均的分类报告python代码

怎么利用k折交叉方法算auc请写出代码

Tensorflow==1.12 车牌识别项目.zip

Don-t-Get-Kicked:==吗？ 1个

train_test_x.rar

最新推荐

Python实现K折交叉验证法的方法步骤

模板059.pptx

全国各地电信铁通DNS服务器地址.doc

最新服务器双机热备解决方案.docx

服务器及存储高性能双机热备方案.docx

VMP技术解析：Handle块优化与壳模板初始化

管理建模和仿真的文件

【进阶】音频处理基础：使用Librosa

python中字典转换成json

C++ Primer 第四版更新：现代编程风格与标准库