Traceback (most recent call last): File "C:\Users\test\PycharmProjects\pythonProject\链接导入csv中.py", line 57, in <module> df_extracted = df_extracted.append({'File Name': file_name}, ignore_index=True) ^^^^^^^^^^^^^^^^^^^ File "C:\Users\test\PycharmProjects\pythonProject\venv\Lib\site-packages\pandas\core\generic.py", line 5989, in getattr return object.getattribute(self, name) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ AttributeError: 'DataFrame' object has no attribute 'append'. Did you mean: '_append'?

时间: 2023-08-05 15:05:41 浏览: 42

抱歉，我之前的回答有误。在最新版本的pandas中，DataFrame对象没有`append`方法。你可以使用`concat`函数来将提取的数据追加到DataFrame中。以下是修改后的示例代码： ```python import os from bs4 import BeautifulSoup import re import pandas as pd # 指定文件夹路径 folder_path = "C:/Users/test/Desktop/DIDItest" # 正则表达式模式 pattern = r'<body>(.*?)<\/body>' # 创建一个空的DataFrame用于存储提取的文件名数据 df_extracted = pd.DataFrame(columns=['File Name']) # 遍历文件夹中的所有文件 for root, dirs, files in os.walk(folder_path): for file in files: # 读取html文件 file_path = os.path.join(root, file) with open(file_path, "r", encoding="utf-8-sig") as f: html_code = f.read() # 创建BeautifulSoup对象 soup = BeautifulSoup(html_code, 'html.parser') # 使用正则表达式匹配<body>标签内的数据 body_data = re.findall(pattern, html_code, re.DOTALL) # 剔除<p>和()</p> body_data = body_data[0].replace("<p>", "").replace("()</p>", "") # 使用正则表达式提取文件名 matches = re.findall(r'"([^"]*)"', body_data) if matches: file_name = matches[0] else: file_name = "No matches found." # 替换字符 file_name = file_name.replace('No matches found.', '') # 将提取的文件名数据加入DataFrame中 df_extracted = pd.concat([df_extracted, pd.DataFrame({'File Name': [file_name]})], ignore_index=True) # 读取原有的CSV文件 csv_file = "output.csv" df_original = pd.read_csv(csv_file) # 合并原有的DataFrame和提取的DataFrame df_combined = pd.concat([df_original, df_extracted], axis=1) # 将合并后的DataFrame写入CSV文件 df_combined.to_csv(csv_file, index=False) print("数据已成功写入CSV文件") ``` 在上面的代码中，我使用`pd.concat()`函数将提取的文件名数据追加到`df_extracted`中。然后，我将原始的DataFrame `df_original` 和提取的DataFrame `df_extracted` 合并为一个新的DataFrame `df_combined`。最后，使用`df_combined.to_csv()`将合并后的DataFrame写入CSV文件。请确保提供正确的文件夹路径和CSV文件路径，并根据需要调整代码。再次对之前的错误回答表示歉意。

相关推荐

Mac下anaconda中Spyder打开报错Traceback(most recent call last)…问题

python-unit-test-tool-comparison:比较Python unittest，nose和py.test

Python程序基础：Python中的异常.pptx

Traceback (most recent call last): File "C:\Users\test\PycharmProjects\pythonProject\DIDI数据写入CSV.py", line 65, in <module> file.close() ^^^^ NameError: name 'file' is not defined. Did you mean: 'files'?

Traceback (most recent call last): File "C:\Users\test\PycharmProjects\pythonProject\DIDI数据写入CSV.py", line 40, in <module> if '' in content: ^^^^^^^ NameError: name 'content' is not defined

Traceback (most recent call last): File "C:\Users\test\PycharmProjects\pythonProject\DIDI数据写入CSV.py", line 63, in <module> writer.writerow([talk_id, time, send_id, receive_id, talk_type]) ValueError: I/O operation on closed file.

Traceback (most recent call last): File "C:\Users\test\PycharmProjects\pythonProject\DIDI数据写入CSV.py", line 38, in <module> print("talkid:", talk_id) ^^^^^^^ NameError: name 'talk_id' is not defined

Traceback (most recent call last): File "C:\Users\test\PycharmProjects\pythonProject\DIDI数据写入CSV.py", line 82, in <module> writer.writerow([talk_id, time, send_id, receive_id, talk_type, content]) ^^^^^^^ NameError: name 'talk_id' is not defined

Traceback (most recent call last): File "C:\Users\test\PycharmProjects\pythonProject\DIDI数据写入CSV.py", line 26, in <module> talk_id = message.find_previous('a').text.strip()[1:] ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ AttributeError: 'NoneType' object has no attribute 'text'

Traceback (most recent call last): File "C:\Users\test\PycharmProjects\pythonProject\DIDI数据写入CSV.py", line 32, in <module> content = message.find('a').get('href') if message.find('a') else message.contents[-1].strip() ^^^^^^^^^^^^^^^^^^^^^^^^^^^^ TypeError: 'NoneType' object is not callable

Traceback (most recent call last): File "C:\Users\test\PycharmProjects\pythonProject\DIDI_test1.py", line 39, in <module> content = re.findall(r'', match[5])[0] if type in ['音频', '图片'] else match[5] ~~~~~^^^ IndexError: tuple index out of range

Traceback (most recent call last): File "C:\Users\test\PycharmProjects\pythonProject\DIDI_test1.py", line 37, in <module> content = re.findall(r'', match[5])[0] if type in ['音频', '图片'] else match[5] ~~~~~^^^ IndexError: tuple index out of range

最新推荐

基于AT89C51单片机的三电梯联动控制系统+全部资料+详细文档（高分项目）.zip

梯度下降算法：介绍梯度下降算法 实例说明其运行原理

node-v4.8.0-linux-ppc64.tar.xz

zigbee-cluster-library-specification

管理建模和仿真的文件

实现实时数据湖架构：Kafka与Hive集成

list根据id查询pid 然后依次获取到所有的子节点数据

JSBSim Reference Manual

"互动学习：行动中的多样性与论文攻读经历"

实现实时监控告警系统：Kafka与Grafana整合

梯度下降算法：介绍梯度下降算法实例说明其运行原理