df = pd.read_table(path, skiprows=10,delim_whitespace=True, names=f[9].split()[2:])

这是一个 Python 中使用 pandas 库读取文本文件的代码，其中 path 是文件路径，skiprows=10 表示跳过前 10 行，delim_whitespace=True 表示使用空格作为分隔符，names=f[9].split()[2:] 表示使用第 9 个元素的第三个空格后的字符串作为列名。

train_output=pd.read_excel('test.xlsx', nrows=100, usecols=[36]) test_output=pd.read_excel('test.xlsx', skiprows=99, nrows=50, usecols=[36])

您好！对于您的代码，我可以看到您正在使用pandas库中的read_excel函数从Excel文件中读取数据。您使用nrows参数来指定要读取的行数，usecols参数来指定要读取的列数。具体来说，您正在读取名为'test.xlsx'的Excel文件，并只选择第36列。对于训练数据，您设置nrows参数为100，这意味着您将只读取前100行。而对于测试数据，您使用skiprows参数跳过前99行，并设置nrows参数为50，这意味着您将从第100行开始读取后续的50行。这样，您将得到一个名为train_output的DataFrame，其中包含训练数据，以及一个名为test_output的DataFrame，其中包含测试数据。请注意，您需要确保文件路径和文件名正确，并且已经安装了pandas库。如果文件不存在或路径不正确，将会引发错误。

代码错：import pandas as pd # 读取第一个 Excel 表格的前两行数据 df1 = pd.read_excel('附件一：2022年12月末存量公司客户管户认领情况表海沧-1.xlsx', header=[0, 1], nrows=2) # 定义一个空的 DataFrame，用于存储所有 Excel 表格的数据 result = pd.DataFrame() # 循环读取其他 Excel 表格的数据，跳过前两行，拼接到 result 中 for file in [ '附件三：2022年12月末存量机构客户管户认领情况表海沧支行-1.xlsx', '附件二：2023年3月末新拓公司客户管户认领情况表海沧-1.xlsx', '附件四：2023年3月末新拓机构客户管户认领情况表海沧支行-1.xlsx' ]: df = pd.read_excel(file, header=[0, 1], skiprows=2) result = pd.concat([result, df], ignore_index=True) # 将第一个 Excel 表格的前两行和拼接后的所有数据合并 df = pd.concat([df1, result], ignore_index=True) df = df.reset_index() # 输出到新的 Excel 表格中 df.to_excel('new_file.xlsx', index=False)

看起来你在加入 reset_index() 后仍然出现了错误。这可能是因为你的 DataFrame 中的列名或者行名中包含了特殊字符，例如空格、冒号、斜杠等，这些字符在 Excel 中可能会导致问题。你可以尝试修改列名和行名，将其中的特殊字符替换成普通字符，然后再输出到 Excel 表格中。示例代码如下： ``` import pandas as pd # 读取第一个 Excel 表格的前两行数据 df1 = pd.read_excel('附件一：2022年12月末存量公司客户管户认领情况表__海沧-1.xlsx', header=[0, 1], nrows=2) # 定义一个空的 DataFrame，用于存储所有 Excel 表格的数据 result = pd.DataFrame() # 循环读取其他 Excel 表格的数据，跳过前两行，拼接到 result 中 for file in [ '附件三：2022年12月末存量机构客户管户认领情况表__海沧支行-1.xlsx', '附件二：2023年3月末新拓公司客户管户认领情况表__海沧-1.xlsx', '附件四：2023年3月末新拓机构客户管户认领情况表__海沧支行-1.xlsx' ]: df = pd.read_excel(file, header=[0, 1], skiprows=2) result = pd.concat([result, df], ignore_index=True) # 将第一个 Excel 表格的前两行和拼接后的所有数据合并，替换列名和行名中的特殊字符 df = pd.concat([df1, result], ignore_index=True) df.columns = [col.replace(' ', '_').replace(':', '_').replace('/', '_') for col in df.columns] df.index = [idx.replace(' ', '_').replace(':', '_').replace('/', '_') for idx in df.index] # 输出到新的 Excel 表格中 df.to_excel('new_file.xlsx', index=False) ``` 这里的代码中将列名和行名中的空格、冒号和斜杠替换成了下划线。你可以根据实际情况修改替换的字符。

阅读全文

df = pd.read_table(path, skiprows=10,delim_whitespace=True, names=f[9].split()[2:])

train_output=pd.read_excel('test.xlsx', nrows=100, usecols=[36]) test_output=pd.read_excel('test.xlsx', skiprows=99, nrows=50, usecols=[36])

相关推荐

详解pandas库pd.read_excel操作读取excel文件参数整理与实例

df.rank & pd.pivot_table & pd.read_excel & df添加行 &调整df行列顺序(reindex再理解）

Python 中pandas.read_excel详细介绍

df = pd.read_csv("/Users/naonaowa/Desktop/ML/Experiment_3/house_price_data_extends.csv", names=column_names, skiprows=1)帮我根据题目要求编写接下来的代码

df = pd.read_excel 读取指定例

df = pd.read_excel函数，具体

y = pd.read_excel(file_path, usecols=[2],index_col=False, header=None ,skiprows=[0])

df = pd.read_excel(file)跳过前两行

改为一个集合表for i in range(2, n): # 假设有 n 个 Excel 表格 df = pd.read_excel(f'file{i}.xlsx', header=[0, 1], skiprows=2)

import pandas as pd df = pd.read_excel('D:\\NC\\2007.xls', sheet_name='Cu', skiprows=9, nrows=1) print(df.to_string()，按原来的等行跟列显示输出

in cal_all_etf_data df = pd.read_csv(etf_list + '//' + etf + '.csv', encoding='gbk', skiprows=1, parse_dates=['交易日期'])

csdf = pd.read_csv(n,)中pd.read_csv有哪些参数

大家在看

麒麟V10桌面SP1网卡驱动

LIFBASE帮助文件

使用eclipse来写R程序

2000-2022年 上市公司-股价崩盘风险相关数据（数据共52234个样本，包含do文件、excel数据和参考文献）.zip

设置fastreport.net 预览界面按钮.txt

最新推荐

白色简洁风格的学术交流会议源码下载.zip

基于交变电流场测量技术的水下结构缺陷可视化与智能识别方法

掌握HTML/CSS/JS和Node.js的Web应用开发实践

管理建模和仿真的文件

计算机体系结构概述：基础概念与发展趋势

int a[][3]={{1,2},{4}}输出这个数组

勒玛算法研讨会项目：在线商店模拟与Qt界面实现

"互动学习：行动中的多样性与论文攻读经历"

【计算机组成原理精讲】：从零开始深入理解计算机硬件

vue2加载高德地图

2000-2022年上市公司-股价崩盘风险相关数据（数据共52234个样本，包含do文件、excel数据和参考文献）.zip